FIGURE i 



o 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGT C T GAAGAC T AACAT T T T GT GAAGTT GT AAAACAGAAAACC T GT T AGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T CAT AC AT T AC T GCAGTAACACTCCACCAT ATAGACCC GGC T T TACC T TATATCAG TGACAC T GG 
TACAGT AGC TCCAGAAAAAT GC T TAT T T GGGGCAAT GC T AAATAT TGC GGCAGT T T TAT GC AT TG 
C TACCATT TATGTT CGTTATAAGCAAGTT CATGCTCTGAGTCCTGAAGAGAACGTTATCAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGT T TGTTCAGACCATCCT TT CCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 



W ATCAGT TTTGCACAGTGGCAAT TT TGGGACTGAT TTAGAACAGAAACT CCAT TGGAACCCCGAGG 

SI 

ACAAAG G T TAT G T GC T T CACAT GAT C AC T AC T GCAG C AG AAT GG TC TATG T CAT TTTCCTTCTTT 

14 

: s i GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 

s TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 

CJ GAGATAT T TGAT GAAAGGATAAAATAT T T CT GTAATGATTAT GATTCT CAGGGAT T GGGGAAAGG 

Vj TTCACAGAAGT TGCT TATTCT TCTCTGAAAT TT TCAACCACTTAATCAAGGCT GACAGTAACACT 

H GATGAATGCT GATAATCAGGAAACAT GAAAGAAGCCAT T T GAT AGATT AT TC TAAAGGATAT CAT 

S5 CAAGAAGACTATTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 

HI ATG 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 

LCIAT I YVRYKQVHALSPEENVI IKLNKAGLVLGI LSCLGLS IVANFQKTTLFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMI TTAAEWSMS FS FFGFFLTYIRDFQKI SLRVEANLHGLTLYDTAPCP INNERTR 
LLSRDI 

U 

*5?r 3 

ftl Important features: 

f|| Type II transmembrane domain: 

III amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites . 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 

m 



m 
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FIGURE 3 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACGTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAAGAACGA 
. . CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
O CAT TCT CAGCCCAAAACATGGGATCTT ATCCATAGAACAGCTCAT CAGCCGGGT TGGTGT GATT GGAG 

CI TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
f\ CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
g T AAC CAT CAGGT T TCT GGGGAAT GAT AAAAAGT G T TAC CACT T C AG CAT CAGGAAG TGAAAAT C T T AC T 

%l CTTATTCAACAGGAAGT GGAT GCTT T GGAAGAATTAAGCAGGCAGCT T T TT CT GGAAACAGCTGATCT 

gt ATAT GCT AC CAAGGAGAGAATAGAATACT CCAAAACCTTCAAGGGGAAATATTTT AATT TT CT T GGTT 

E| ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
Wl G GGAAAA.CGGAT CCTGTCACAAGAGGCAT TGAGAT CACT GTGAATTAT CTGGGAATCCAAT TTGATGT 

g GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 

|Sf TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 

in I 

f|V CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
T GTAGAGGCGGAGAGGAGCCAAGAAAC TAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTT GAGATTGACTCATTAAAAT CAGAGACTGTAACAAAAAAZy\AAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE ^ 

MS FL I DS S IMI T SQ I LFFG FGWL FFMRQL FKDYE IRQYWQVI FS VT FAFS CTMFEL 1 1 FE I LGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FP I LS PKHG I LS IEQL I SRVGVI GVTLMALLSGFGAVNCPYT YMS YFLRNVTDTDI LALERRLLQ 
TMDMI I SKKKRMAMARRTMFQKGE VHNKPS G FWGM IKS VTT S AS GS ENLTL I QQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKI FMATINIVFDRVGKTDPVTRGIEI 
T VNYL G I Q FDVK FWS QH I S F I LVG 1 1 1 VT S I RGLL I T LTKFFYAI S S S KS S NV I VLLLAQ IMGM Y 
FVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSALSSILFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102," 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobal ami n -binding proteins 

amino acids 151-160 
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FIGURE 5 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGGAGTGAGTGTGAGCCTCAACATAGTTCC 

AGAAC T C T C CAT C C GG AC TAG T TAT T GAG CAT CTGCCTCT CAT AT C AC C AG T G GC CAT C T GAG G T 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 

G T CAT G C AG AAT TAT G G G GAT C AC C C T T G T G AGC AAAAAG G C GAAC CAG C AG C T G AAT T T C AC AG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT T TGCAGCCT AT TG T TACAACT CATC TGAT ACT T GGAC TAAC T CGT GCATT CCAGAA 
AT TATCACCACCAAAGATCCCATATTCAACACTCAAACTGCAACACAAACAACAGAATT TATTGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 

GAAACTAGCACCATGTCTACAGAAAC T GAACCAT T T GT T GAAAA.TAAAGCAGCAT T CAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
T GAGGAAT CAAAGAAAAC T GATAAAAACC CAGAAGAGTCCAAGAGTCCAAGCAAAAC TACCGT GC 
GAT GCC T GGAAGC T GAAG T T TAGAT GAGACAGAAAT GAGGAGAC ACACC TGAGGC T GGTTTCTTT 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCAGTGAGACCAAGGCT T TC TCTAC T GAT T CC GCAGCT CAGACCC T TT CT T CA 
GC TC T GAAAGAGAAACACG T AT CCCACCT GACATGT CCT T C TGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAGACCTAATCTCTGTAAA 
GCTAAAAT AAAGAAAT AGAAC AAGGC T GAGGAT ACGACAGT ACAC TGT CAGCAGGGACT GT AAAC 
ACAGACAGGGTC AAAGTGT T TT C TC TGAACACAT TGAGTT GGAAT CAC T GT T TAGAACACACACA 
C T TAC T T T T T C T GGTC TCTAC CAC T GCT GATAT T TT C T CT AGGAAATATACT TT TACAAGTAACA 
AAAATAAAAACTC TTATAAATT TCTATT TT TATCTGAGT TACAGAAATGATTACTAAGGAAGAT T 
ACTCAGT AATT TGT T T AAAAAG T AAT AAAAT TCAACAZ^ACATT T GCT GAATAGC TACTATAT GT C 
AAGTGCTGTGCAAGGTATTACACTCTGTAATTGAATATTATTCCTCAAAAAATTGCACATAGTAG 
AACGCT AT C T GGGAAGCT AT TT T TT TCAGT T T T GATAT T T CT AGC T TAT C TACT T CCAAACT AAT 
T T T TAT TT T T GC T G AG ACT AAT CT TAT T CAT T T TCTC T AATATGGCAACCAT TATAAC C T TAAT T 
TAT TAT T AACATACC T AAGAAGTACAT TGT TACC TC TATATACCAAAGCACAT T T TAAAAGT GCC 
AT T AAC AAAT G TAT CAC TAG CCCTCCTTTTTC C AAC AAG AAG GG AC T GAG AG AT GC AGAAAT AT T 
T GTGAC AAAAAAT TAAAGC AT T TAGAAAAC T T 



FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L S LAGKDQVETALKAS FSTC S YGWVGDG FWI SR I S PNPKCGKNGVGVL I WKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENK^AFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 



Transmembrane domain : 

hi amino acids 235-254 



%l 



N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 



fjf Casein kinase II phosphorylation site. 

CI amino acids 145-149, 214-218 



Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 

GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 

CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 

CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 

GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 

GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 

AAGCAT CATCAGAAGT GAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 

AC G AAGG T T G G AAAT AAT AC CAT C CAT G T G C AC C GAG AAAT T C AC AAG AT AAC CAAC AAC C AG AC 

TGGACAAAT GGTCTT TT CAGAGACAGT TATCACAT CT GT GGGAGACGAAGAAGGCAGAAGGAGCC 

ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 

ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 

GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 

ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 

ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 

CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 

GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 

CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 

CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 

TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 

CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 

CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 

GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 

TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 

TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 

CAAC AAAT GAATT T T CCACGCAGT T C T T T CC AT GGGC AT AGG T AAGC T G T GCC T TCAGCTGT T GC 

AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 

AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 

CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 

CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 

TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 

CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 

ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 

TACAGGT TAACCTGCAGAAACAGTACTTAGGTAAT TGTAGGGCGAGGATTATAAATGAAAT TTGC 

AAAATCAC TT AGCAGC AAC TGAAGACAAT TAT CAAC CACGTGGAGAAAAT CAAACCGAGCAGGGC 

T GT GT GAAACAT GGT T GTAATATGCGAC T GCGAACACT GAACTC TACGCCACTCCACAAAT GAT G 

T T T T CAGGTG T CAT GGAC T GT TGCCACCAT GTAT TCAT CCAGAGT T C T TAAAGT T T AAAGT TGCA 

CATGATTGTATAAGCATGCTT TCTT TGAGT TT TAAAT TAT GTATAAACATAAGT TGCATTTAGAA 

ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAA?\AAAAAAAAA?iA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWLANLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 

RS L TEEMALGE PAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-203 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 



111 327-331 



N-myristoylation site. 

amino acids 202-208, 217-223 

Ami da ti on site. 

amino acids 140-144 
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FIGURE Q 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAG T GC TGCC TC TGGT G TCT GGAAAAAT T TATCAAGT T CC TAAACCGCAATGCATACAT CAT GA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT TCT GGGCAAGAAGAACGAGGCGCCC CCGGACAACAAGAAGAGGAAGAAGTGACAGCT CCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGAT T T TAT TAAAGATAT TT TGT TAAC T C 
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FIGURE IP 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQDI PT FPL I SAFI RTLRYHTGSLAFGAL I LTLVQI ARVT L 
EYIDHKLRGVQNPVARC IMCCFKCCLWCLEKFIKFLNRNAYIMIAI YGKNFCVSAKNAFMLLMRN 
I VRVWLDKVTDLLL FFGKLL WGGVGVLS FFFFSGRI PGLGKDFKS PHLNYYWLP IMT S I LGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features : 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites . 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 30 6-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCGCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
m CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
p GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
S! TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
W CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
I , TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
f;| TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
111 AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
fctf GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 

tn 

ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGC TGA GGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GAC TTCGTGCCT T AC T GAGT CT C T AAGAC t TT T T CT AAT AAAC AAGC CAGT GCGT GT AAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLI LVGLT VGAFYI PDGS FTN I WFYFGWGS FLFI L I QLVLL I DFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTE PSGCHEGKVFI SLNLT FCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVLASLHVMMTLTNWYKPGETRKMISTWTAVWVKICASWAGLLLYLWTLVAPLLLRNRD 
FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



fin 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT T GGAAAAAGAC TCC T GT AACC C TCCTCCAGG ATGA ACCACC TGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCAT TGAGAACACAT TAGAGAAGGAGG TGAT GCAGTATGACT AC TAT T C TT CAT AT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 



CAGACT CC T GAT AG T T CAGGATGC TTCAGAGAGGGCAGCACT TATACCT GG T GGTC T T T CT GAT G 
%l GTCAGT T T TAT TCC CCTCC T GAAT CC GAAGCAGGATCT GAAGAAGC TGAAGAAAAACAGGACAGT 

H GAGAAACCAC T T T TAGAACT ATGAGTAC TAC T T T TGTTAAAT GT GAAAAACCC TCACAGAAAG TC 

ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
% b CAC T GC T GGCT TTAT TGAACAGC TAATAAAGAT T TAT T TAT T G T AATACCT CACAAACGT T G TAC 

CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAGACTTAATATATT GAAGTAACACT T TT TTAG TAAGCAAGATACCT T T T TAT T TCAAT T CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T T GC AAAAG G G GAAG AAAG GAAT T G C GAAT AC AT G T AAAAT G T CAC C AG AC AT T T G TAT TAT T T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACT TAAACCATT CATATCATGT TTCC TT T GCGTTCAGCCAAT TTCAAT T AAAATGAA 
C TAAAT TAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI I ELNVNGG I ENT LEKE VMQYDY YS S YFD I FLLAVFRFKVL I LAYAVCRLRHWWAI ALT T 
AVTSAFLIAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AAL I PGGLSDGQFYS PPE SE AGSEEAEEKQDS EKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

O 

m Transmembrane domains: 

{si amino acids 54-72, 100-118, 130-144, 146-166 

f i! * '. N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 

is 

'CI 

PI • 

■WSJ? 

ill 
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FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGAGTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGGTTGTGCCCCGAAGGCTTCACG 
GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
GCGAGGAGGCCTGCGGGGAGGCCCATACACGCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
T T TAGGAACATGTT T TGC T T T T T TAAAATATATATAT T TATAAGAGAT CCT TTC CCAT TT AT TC T 
GGGAAGAT GT T T TT CAAACTCAGAGACAAGGAC T TTGGT T TT T G TAAGACAAAC GAT GAT AT GAA 
GGCC T T TT GT AAGAAAAAAT AAAAGATGAAGT GT GAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLIALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
L1AALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQS PLHAKPY I 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATG CG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT G AAG AAG G CAG C CAG GAT GAAT C C T T AGAT T C C AAG AC T AC T T T GAC AT CAG AT GAG T C AG T A 
AAGGACCAT AC T AC TGCAGGCAGAGTAG T TGC TGG T CAAATATT TC T TGAT TCAGAAGAATCTGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATAT CAGC TT T C T AGAGT CTCCAAAT CCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T T C CTAGATAAGGAGT AT GATGAATGT ACAT CAGATGGGAGGGAAGAT GGCAGAC TGTGGTGTG 
™* C TACAACC TAT GAC T ACAAAGCAGAT GAAAAG TGGGGCT T T TGTGAAACT GAAGAAGAGGC TGCT 

m 

. AAGAGACG GC AGATGCAGGAAGCAGAAATGATGTAT CAAACT GGAATGAAAATCCT TAAT GGAAG 
| R j CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
Sj CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 

M s GCAGCGAGAGAGATG T TT GAGAAGC T GAC T GAGGAAGGC T CT CCCAAGGGACAGAC TGCT CT T GG 

%l 

'* CTTTCTGTATGCCTCTGGACTTGGTGTTAAT.TCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 

m 

pi TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT TAG TGGAAGGCT 

. g f 5 AATAATAT T AAC ATCAGAAGAAT T TGTGGTT TATAGC GGCCACAACT T T T T CAGC T T TCAT GAT C 



r-1 



CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 
AACACATGGAATCTACATGTAAATGAAAGT T GGTGGAGTCCACAAT T TTTCTTTAAAATGATTAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT T G T CAGAAT C AT T T T T T AC AT TAG AT TAT C AT AAT T T TAAAAAT T T T T C T T TAG T T T T T C A 
AAAT TTT GTAAATGGTGGCTATAGAAAAACAACATGAAATATTATACAATATTT TGCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AACT CAT TTT T AAT AAAAT TAT G T C T AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAPAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 

MRVRI GLT LLLCAVLLSLASAS S DEEGSQDE S LDSKTTLT S DE S VKDHT TAGRWAGQI FLDSEE SEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVYYT FGALGGNLI AHMVLVSRL 



Important features: 
Signal peptide: 

amino acids 1-21 

||l N-glycosylation sites. 

Ml amino acids 195-199, 217-221, 272-276 



Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 



Microbodies C-terminal targeting signal. 

amino acids 299-303 



Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 

AATTCAGATT T TAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCT TCT T 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAAGTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGAT TTCATGTAATCGCTGCCTGTCTGAC TGAATCAGGATCAACAGC T TTAAAGGCAGA 
AACC T CAGAGAGAC T T CG TAC TGTGC T TC TGGATGTGACCGACCCAGAGAATG TCAAGAGGACTG 
CCCAGTGGGTGAAGAACGAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CCTGTTTG GAC T CAT CAG T GT GAC AC T AAAT AT GC TTCCTTTGGT C AAGAAAG C T C AAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 
TCGCCATT TGGGAGCAGCT GTCTCCAGACATCAAACAACAATATGGAGAAGGT TACATTGAAAAA 
AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGC T GGC TAAT CCCAAGGCAGT G TGAC T CAGC T AACCAC AAATGTC TCC T CCAGGC TAT GA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTC T TAT C T AAAT AT T AAAAGAT AAGTC AACC CAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENWRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEVNLFGL ISVTLNMLPLVKKAQGRVINVS SVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LS P WECMDHALT S L FPKTHYAAGKDAK I FW I PL S HMPAALQD FLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGT GAAGCCAAGAACAGCAT TACT GAT TC CCAAAT GGAT GAT GT T GAAGT T GT TT AT AC AAT T GA 
CAT TCAGAAATATAT TCCATGC TAT CAGCT T T T T AGCT T T TATAAT TCT T CAGGCGAAGTAAAT G 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
TTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAATAACAGAAAGCTGCTCTACTC 
jU ATCGACT GGAACAT TCCT T ATATAAACCT CAAAAAGGACT T T T TCACAGGGTACCT T TAG T GGT T 

CI GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
U T TT TAGCCGAGC AGTACAAACACACAGC T C T AAAT T T T T T GAAGAAGAT GGATCC TTAAAGGAGG 

,f I TACATAAGAT AAAT GAAATGTATGCT T CATTACAAGAGGAAT TAAAGAGTAT AT GCAAAAAAGT G 

%l GAAGACAG TGAACAAGCAGT AGATAAAC TAGTAAAGGAT GTAAACAGAT T AAAAC GAGAAATT GA 

jU GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAAT CTGACCT TAATGGTAGAACACACT GACAT TCCT GAAGCTAGT C CAGC TAG TACACCAC 
■ AAATC AT TAAGCATAAAGCCT TAGAC T TAGATGACAGAT GGCAATT CAAGAGATC TCGGT T GT TA 
GATACACAAGACAAACGAT CTAAAGCAAATAC T GGTAGT AGT AACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGATGAAGGGT T T TGG TGAATAT TCACGGTCT C 
C TACATT T TGATCC T T TT AACC TT ACAAGGAGAT T T T T T TAT T TGGC T GATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTT TGCAGTAATACACAGATAACTC TTAGTGCATT TACT TCACAAAGTACTTT TTCAAACATCA 
GATGCT T T TAT T TCCAAACCT T T TT T TCACCT T TCAC T AAGT TGT T GAGGGGAAGGC T TACACAG 
ACACAT T CT T TAGAAT TGGAAAAGT GAGACCAGGCACAGT GGC TCACACCT GTAATCC CAGCAC T 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTATTTTCAAAATATGGAAA 
GAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CAT TATGGC TAGAG TT GC CAGATAAAAT GCT GGATAT CATGCAATAAAT T T GCAAAACATCAT C T ' 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 

MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PC YQL FS FYNS S GE VNE QALKK I L S NVKKNWGW YK FRRH S DQ IMT FRE RLLHKNL QEH FSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSSCNYNHHLDWDNLTL 
MVEHTD I PEAS PAS T PQ 1 1 KHKALDLDDRWQ FKRS RLLDTQDKRS KANTGS SNQDKASKMS S PE T 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 
amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



G GCACAGC CGCGC GGC GGAG GGCAGAGT CAG C CGAGCCGAGT C CAG CC GGACGAG C GGAC CAGCGCAGGGCAGCC CAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGAT GCC CCC CT CCACGAGATCAAC GGT GAT CAT T TGAAGAT CTGTC CCCAGG GTT CTACCT GCT GCT CT 
H : CAAGAGAT GGAGGAGAAGTACAGCCT GCAAAGTAAAGAT GAT TTCAAAAGT GT GGT CAGCGAACAGT GCAAT CAT T T G 

Q CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
AATGATATGTTTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAA.GATCTCTTCGTAGAGTTG 
||! AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
Ul TT CCG CCT GGT GAACT C CCAGTACCACTTT ACAGAT GAGTAT CTGGAATGT GT GAG CAAGTAT AC GGAGCAGCTGAAG 

'H CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
N TTAGC GGTTGCG GGAGAT GT C GT GAGCAAGGTCT C C GT GGTAAAC C CCACAGCCC AGT GTACCCAT GCCC T GT TGAAG 

%f ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
* TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 
Q GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
HI AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
fel TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
Bit AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
.f£f GTTTGCAACGAT GAGAGGAT GGCTGCAGGAAACG GCAAT GAGGAT GACT GTT GGAAT GGGAAAGGCAAAAG CAGGTAC 

I II CTGTTTGCAGT GACAGGAAAT GGAT TAGCCAACCAGGG CAACAACC CAGAGGT CCAGGT T GACAC CAGCAAACCAGAC 

ATACT GAT CCT TC GT CAAAT CAT GGCT CTT C GAGT GAT GACCAGCAAGAT GAAGAAT GCATACAAT GGGAACGACGTG 
GACTT CTT T GAT AT CAGT GAT GAAAGTAGT GGAGAAGGAAGT GGAAGT GG CT GT GAGT ATCAGCAGTGCC CTT CAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGC CTACCT CCT CACT GT CT T CT GCAT CTT GTT C CT GGTTATGCAGAGAGAGT GGAG ATAA TTCT CAAACT CT GAG 
AAAAAGT GTT CAT CAAAAAGTTAAAAG GCAC CAGT TAT CACTT TT CTACCATC CTAGT GACTT TGCTTTTTAAAT GAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACT GT GCATT GAGTT GGT T CC TGCT CCC CCAAACCAT GT TAAACGT GGC T AACAGT GTAG GTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
TCTGAAATATTAAATAGCTGTACAGAAGCAGGTTTTATTTATCATGTTATCTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAA.LLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNWLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKP FGDVPRKLKLQVTRAFVAART FAQGIAVAGDWSKVSVVNPTAQCTHALLKM I YCSHCRGL 
VT VKPC YN YCSN IMRGC LANQGDLD FEWNN F I DAMLMVAERLE G P FN I E S VMDP I DVK I S DAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPS3WCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPD I L I LRQ IMALRVMT S KMKNAYNGNDVDF FD I S DE S S GEGS GSGCE YQQC PS E FD YNAT DHAG 
KS ANEKADS AGVRP GAQAYLL TVFCIL FLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGC T T CT AGGAGAT GGCTCC AGGAAGGCGGCC AAGAAT GTGAG TGCAAAGAT TGGT TCC T GAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTG TAG GAGCTCTG 
AGCGCCCACTCTTCCAAT TAAACAT TCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGT GTCATT TAACCT TAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATT TTT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVLISSLLLLLPmMSlWSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features : 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 



f ?i l amino acids 27-33, 4 6-52 

•SSif 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATT^AAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
N CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
JgJ TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
|| GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
y GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 



GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACAT T CATGTACCACCTACTAGTC CC TC T C TC C CCAACC TC TGCCAGGGC 
AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAA.CGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGAC TCAAAGAGGCAGAGGT TT TGTTCT CAAATATT T T T TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMA.VLTPKSNRKMSSKKRELFSQIKGL 
TGASGKVALLSLGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFVVAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQP I YLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

UL amino acids 10-32 

'■'•i 

P s N-myristoylation sites. 

- amino acids 64-70, 78-84, 80-86, 91-97, 201-207 

.& 
.'pas 

m 



III 



rii 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCT ATGC TGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGTVAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGT TTATGAT TCT TATACTAATTTATACAAAGATATTAAGGCCCTGTTCATTAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 



k. s 
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FIGURE so 



MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 



amino acids 3-18 
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FIGURE 31 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGC TCGTCT TGAT TT T T GT TCTCAGAAAGAGAATAAAAT TGACAGTT GAGCTT TTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATT GGCC T CAT CTGGACTAGTGAAT T CATC CT T GC GT GC CAGCAAAT GAC TATAGC T GGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 



%A TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
is GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 
pi CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 

AACCAGAATGCATAT AC T ACAAC T GC TAT TAATGGGACAGAT T TC T G TACATCAGCAAAAG AT GC 

r i 

^ ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
p TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
f|| AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 

CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAATGAT GGATCGTCAGAAAAGCCCT AC TT T AT GGATCAAGAAT T TC TGAGT T T CGTA 
AAAAGGAGCAACAAAT TAAACAATGCAAGGGCACAGCAGGACAAGCACT CATTAAGGAATGAGGA 
GGGAACAGAAC T CCAGGCCAT T G TGAGATAGATACCCATT TAGGTATCT GTACC TGGAAAACAT T 
TCC T TC TAAGAGCCAT TTACAGAATAGAAGAT GAGACCAC TAGAGAAAAGT TAGT GAAT x T T T T T 
T TAAAAGACC TAATAAACCCTAT TC T T CC TCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
I L I FFWVLWAVLLS LGTAGAAQVMEGGQVE YKPL SGI RYMWS YHL I GL I WT S E F ILACQQMT IA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSIMCFGD 
FI I FLGKVLWC FTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFE TVLDALFLC FA 
VDLE TNDG S SE KP Y FMDQE FL S FVKRSNKLNNARAQQDKHS LRNEE G TELQAI VR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 



APPJD=10063717 



Page 186 



FIGURE aa 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATT T T TT GAAAAAAAAAT T GCC T TC T T CAAACAAGGGT GT CAT T C T GAT AT T TATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
C GAAAAGTCCAGC TGCCCAAGCAGGT GAGACCAC AAAAGCC TAT CAGAGGCCACC TAT TCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
g , CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
5" GCCACAGGAGCCAGGAGAT GGATC TC TGGT CCAC TGCCACC TACACAAGCAGCC AAAACAGGCCC 

O AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
0 GGATGT CAGCC TGGGACT T GTT CCAAAAGAAGAAT T GAGCAC ACAGT C TT TGGAGCCAGTAT CCC 

|1| TGGGAGAT CCAAAC T G CAAAAT TGAC TTG T CGT T TT TAAT T GAT GGGAGCACCAGCAT T GGCAAA 

CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
'ft CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 
N ACACGAATTCTCGAGATCTGAAGACAGCCATAGAGAAAATTACTCAGAGAGGAGGACTTTCTAAT 
M. GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
%l GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

GAC T TGCGAGAGAGT C AGGAAT CAAC AT T T T C T T CAT CACCAT T GAAGG T G C TGC TGAAAAT GAG 
» % AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
W': GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
HI ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
Q. GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
iJ AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 

TGGAGT T T GGGT TCGACAAG TACAGCAGCAAGCC TGACAT CCTCAACGCCATCAAGAGGGT GGGC 
W TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
f|| GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GTTTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCAC 
AGCC TCGGAAC TGAAT TCAGAGCAGGCAGAGCACCAGCAAGT GC T GC T T T ACTAAC TGACGT GT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGTATAGAAT GAGCCAAAAGGC TACAT CAT GT TGAGGG TGC T GGAGATT TTACAT 
T T TGACAAT TG T T T T CAAAATAAATGT T CGGAAT ACAGT GCAGCCC T TACGACAGGC T T AC GT AG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T TT GT CAT GACAAT GT AGGAAT TGC TGAATTAAAT GT T T AGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAZ\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAJ\^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTVVLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKA.GKI IDPEFIVKCPAG 

CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 

ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 

TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 

VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 

GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 

RSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENEKQYW 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 

LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWS.GGTSTGAAINFALEQL 

FKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEVIATHPARDHSFF 

VDE FDNLHQYVPRI I QNICTE FNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 3 5 



CCGAGC AC AGGAGATT GC CTGC GT TT AGGAGGT GGC TGCGTT GT GGGAAAAGCT AT C AAGGAAGAAAT TGC 
CAAACCATGTCTTTTT TT CTGT T T TC AGAGTAGT TC AC AACAGAT CT GAGTGTTTT AATT AAGCAT GGAAT 
AC AG AAAAC AAC AAAAAACTT AAGCTT T AATTT C AT CT GG AATT CC AC AGT TT T CT T AGC TCCCT GGAC CC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
C AG ACAAGACTT TC ACTT C AC ACT TCGAGAGC AT T C AAAC TGCT C TC ATC AAAATCCAT T TCT GGT C ATT C 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
T GGT GGGGAT AT GAGGTT CTT ACATTT T TCT T AT T AGGC C AAGAGGCT GAAAAG GAAG AC AAAATGTT GGC 
ATT GTC CTT AGAGGATGAAC AC CT TCT T T AT GGT GAC AT AAT CC GACAAG ATT TTT T AGACACAT AT AAT A 
ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
AAGACAGACACT GATGTT T T C ATC AAT ACT GGC AAT TT AGTG AAGT AT CT TTT AAACCT AAACC AC TC AG A 
GAAGTT TT TC AC AGGT T AT CCT CT AAT T GAT AAT T ATT C CT AT AGAGGAT T TT ACCAAAAAACCC AT AT TT 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
\l TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
$ CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
|Jl ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 
$*! TGGC AGGT C ATG CT AAGG AAC ACC AC AT GC C ATTAT TAACT TCACAT T CT AC AAAAAGCC TAGAAGGACAG 

W GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 

AC AC TGAACT GAAACT CAT G AAAAACCC AG ACTGGAGACT GG AG GGTT AC ACT T GT GAT T TAT T AGTCAGG 
C CC T TC AAAG AT GAT ATGTGGAGGAATT AAAT AT AAAG GAAT TGGAGGTT T TTGCT AAAG AAATT AAT AGG 
ACC AAACAAT TT GGAC AT GT CATT CT GT AG ACT AGAAT T T CT TAAAAGGGTGTT ACTGAG TT AT AAGCTC A 
C TAGGCTGTAAAAAC AAAAC AATGTAGAGT TT T ATT T ATT GAAC AAT GTAGTCACT T G AAGGTTTT GT GT A 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CTGAAC AAAATT TT AC CT GTTT TT GGT C AT TT AT AAAGT ACT TC AAG ATGT T GC AGT AT T TCAC AGTT AT T 
ATTATT T AAAAT TACT T C AACTTT GT GT TT TT AAAT GT T TT GACGAT T TC AAT ACAAG AT AAAAAGGAT AG 
TG AATC ATT C TT T ACAT G CAAACAT T T T CC AGT T AC TT AACT GATC AGTT T ATT AT T GAT AC AT C ACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATATAGAGAAGAAT TAAAGCAAG AAAAT C TG AAAA 
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FIGURE 36 
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MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYOTIERWWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N- glycosylate on sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCT T GC CAAGAGAGTAC ACAGTCAT TAAT GAAGCC TGC C C T GGAGCAGAGTGGAATATCATGTGTCGGGAG TGCTGT G 
AAT ATGAT C AGATT GAGTGCGTC TGCCCCGGAAAGAGGGAAGT C GTGGGTT ATACC ATC C C TTGC TGCAGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCC AGGT TC TGC GAGC C CCAAAGGGTC AGAT TT TGT T GGAAAGC TATCCCC TAAATGCTCAC T GT GAATGGAC CATT CATGC 
TAAACCTGGGT T TGT C ATC CAAC TAAGAT TTGTCAT GT TGAGTC TGGAGT T TGAC TACATGTGC C AGT ATGAC TATGT TGAG 
GTTCGTGATGGAGAC AACC GC GATGGCC AGATC AT CAAGC GTGTC TGTGGC AACGAGCGGC CAGC TCC TATC C AGAGCATAG 
GATCC TC AC TCC ACGT CCT CT TCCACTCCGATGGC T C CAAGAAT T TT GACGGTTTCC ATGCCAT T TATGAGGAGAT CAC AGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TAT AC TGGGCAGCGC TGTGAAAATCTCC T TGAAGAAAGAAAC TGC TCAGAC C CTGGGGGCC CAGT CAATGGGT AC CAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
T C T TAGTGGCAATGAGAAAAGAAC TTGC C AGC AGAAT GGAGAGTGGTCAGGGAAACAGCCCATC TGC ATAAAAGC CT GCCGA 
GAACCAAAGATT TCAGACC TGGTGAGAAGGAGAGT T C T TCC GATGCAGGTT CAGTCAAGGGAGAC ACC ATTAC ACCAGC TAT 
ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 
C TGAGGAC TGGGAAGTGGAGT GGGCGGGCAC CATC C TGC ATCCC TATC TGC GGGAAAAT TGAG AACAT CAC T GCTCC AAAGA 
CCC AAGGGTTGC GCTGGCC GT GGCAGGCAGC CATC T ACAGGAGGACCAGCGGGGTGC AT GACGGCAGCC TACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
® t GTC AC CATGATCAAGAC AGC AGACCTGAAAGTTGT T T TGGGGAAATTC T AC C GGGATGATGACCGGGATGAGAAGACC ATC C 

^4 AGAGC CTACAGAT TTCTGC TATC ATTC T GCATCCC AAC TATGACCCC ATCC TGCT TGAT GC TGACATCGCC ATCCTGAAGC T 

H'l CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
| TCCCACAT C ACT GTGGC TGGC TGGAAT GT CC TGGC AGACGT GAGGAGCCC T GGCT TC AAGAAC GACACACT GC GC T C TGGGG 

PJ TGGTC AGTGT GGTGGAC TCGC TGC TGT GT GAGGAGC AGC ATGAGGACCATGGCATCC CAGT GAGT GTC ACT GATAACATGT T 

C| CTGTGCC AGC TGGGAACCC AC TGCCCC T TC T GATAT C TGC AC TGCAGAGAC AGGAGGCATC GCGGC TGTGTCC TTCCCGGGA 

H| CGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATAAAACATGCAGCCACAGGCTCTCCA 

CTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATTGAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
CTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAGAAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCA 
C TCCAC T GAC CT GGTGGTC TT CCC CAAC T TT CAGT TATAC GAATGCC ATCAGC TT GACC AGGGAAGATC TGGGCT TCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTAC AT GGC CACAGTAC AGTC TGGTC C T TT TCC T TCC C CAT C TC TTGT AC ACAT TT TAATAAAATAAGGGTT GGC TTCT 
G AAC T AC AAAAAAAAAATW^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 

hk 

Si 
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FIGURE ft8 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGS.WGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAP IQS I GS S LHVL FHS DGSKNFDGFHAI YEE I TAC S S S PC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAI YRRTS GVHDGSLHKGAW FLVCS GAL WERT VWAAHCVT DLGKVTMI KTADL 
KWLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 2 60-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAA.TTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTT GTAGCT GACAGAAG GT G GCCAGGGAGAAT GCAGCACACT GCT CGGAGAATGAAG GC GCTT CT GT TGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCTCCC ACTACGGCCT GAC CAAAGATAGGAAGAGGCG C T CACAAGAT GGCT GTC CAGAC GGCTGT GC GAGC C 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCT GCCTACGT GT C CT CGG C AGAGGACGGGCAGC CAGCAAT CAGCC CAGT GGACT CT GGC CG GAGCAAC C GAACT A 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGACAAA.GAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGAAAACACCACTGCCCCTGAAG 
TCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTACCAGCATCAAGATCAATCGAGTAGATCCCAGTGA^ 
L ; ; s GCCTCTCTATTAGGCTGGTGGGAGGTAGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGG 
gj TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
Q ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
r|| GCAGGAACAAT GGAC AGGCC C C GGAT GCCT AC AGACCC C GAGATGACAGCTT T CAT GT GATT C T CAACAAAAGTAGC C 

|s| C CGAGGAGCAGCT TGGAATAAAACTGGT GC GC AAGGTGGAT GAGCCT GGGGTT TTCAT CT T CAAT GTGCT GGATGGC G 

SI GT GTGGCAT AT CGACAT GGT CAGCT T GAGGAGAAT GACCGT GT GT T AGCCATCAAT GGACATGAT CTT CGATATGGCA 

|«*iE GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
SI GGAGCCCT GACAT CTTTCAGGAAGCCGGCT GGAACAGCAAT GGCAGCT GGT CCCCAGGGC CAGGGGAGAGGAGCAACA 

;S r CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
Q GCATGACCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAGCCCGGAGGAG 
|l] ' T CATAAGCAGAGAT G GAAG AATAAAAACAG GT GACATT T T GT T GAATGTGGATGGGGT CGAACT GACAGAGGT CAGCC 

C;!| GGAGT GAGGCAGT GG CAT TAT TGAAAAGAACAT CAT CC T CGAT AGTAC TCAAAGCT TT GGAAGT CAAAGAGTAT GAGC 

H} CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAA.CCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
£| GGGTCATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACACAGCTGGAAGTC 
f|| T GGGCTTCTGCATT GTAGGAGGTTAT GAAGAATACAAT GGAAA.CAAACCTTT TTT CAT CAAAT CCATT GTT GAAG GAA 

CACCAGCATACAAT GATGGAAGAATTAGATGT GGT GATATTCTTCTTGCT GTCAAT GGTAGAAGT ACATCAGGAAT GA 
TACAT GCTTGCTT GG CAAGACTG CT GAAAGAACT TAAAGGAAGAA.TTACT CTAACTATTGTTT CTT GGCCT GGCACT T 
TTTTATAGAAT CAAT GAT GGGTCAGAGGAAAACAGAAAAA.T CACAAATAGGCT AAGAAGTT GAAACACTATAT TT AT C 
TT GTCAGT TTTTATAT TT AAAGAAAGAATACATT GTAAAAAT GT CAGGAAAAGT AT GAT CATCTAATGAAAGC CAGTT 
ACAC CT CAGAAAA.TAT GATT C CAAAAAAATTAAAACTACT AGT TT TTT TT CAGT GT GGAGGAT TT CTCATTACT CTAC 
AACATTGTTTATATTTTTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCTGATTTAAATT TAAAATTTGGTATAT GCT GAAGT CT GC CAAGGGTACATTATG GCCAT T TTT AATT TACAGCT 
AAAATATT TTTTAAAAT GCAT T G C T GAGAAAC GTTGCT T T CAT CAAACAAGAATAAATATT TT T C AGAAGT TAAA 
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FIGURE 40 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I SLMTDEPGLDNPAYVSSAEDGQPAI S PVDSGRSNRTRARPFERS T IRSRS FKKINR 
ALSVLRRTKS GSAVANHADQGRENS ENT TAPEVFPRLYHL I PDGE ITS I KI NRVDPSES LS I RLV 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVI LNKSSPEEQLGI KLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVIiAINGHDLRYGS PE S AAHL I QASERRVHLWSRQVRQRS PD I FQEAGWNSNGS WS PG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLOTDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
M a S DWS PS WVMWLEL PRCL YNCKD I VLRRNTAGSLGFC I VGGYEE YNGNKP FFI KS I VEGT PAYNDG 

, rircgdillavngrstsgmihaclarllkelkgritltivswpgtfl 

U 

m 

hi Important features: 

\$ Signal peptide: 

amino acids 1-15 

Hi 

f?| N-glycosylation sites. 

HI . amino acids 108-112, 157-161, 289-293, 384-388 



ru 



Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-rnyristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 

ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAA.TCAGATTGGAAAAGCTCAACTTGAAGCTTT 

CTTGCCTG C AGT G AAGC AG AG AG AT AG AT AT TAT T C ACGT AAT AAAAAACATGGG C T T C AAC C T GAC T 

TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 

CACCAGT AAC T AC T T C GTGGG T G C CAT T C AAGAGAT T CC T AAAG GAAAG GAGT T CAT GGCT AAT T T C C 

ATAAGACCCTCAT TTT GGGGAAGGGAAAAA.CT CTGACTAATGAAGCAT CCACGAAGAAGGTAGAACTT 

GAC AACT GT CCT T C T GT GTC T CC T T AC CT CAGAGGC C AGAGCAAGCT CAT T T T CAAACC AG AT CT CAC 

TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 

CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 

CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 

TAAAAAGT TTAAT CGAGCCAAACTCTTGAAT GTGGGCTAT CTAGAAGCCCTCAAGGAAGAAAATT GGG 

ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 

CAT C C CAAGCAT C TG G T GGT T GG C AGGAACAG CAC T GGGT ACAG GT T AC GT TACAG TGGAT AT T T T GG 

GGGT GTTACT GCCCTAAGCAGAGAGCAGT TTTTCAAGGTGAAT GGAT TC TCTAACAACTACT GGGGAT 

G GGGAGGC GAAG ACGAT GAC C TC AGAC T CAGG G T T GAGC T CCAAAGAAT GAAAAT TTCCCGGCCCCTG 

CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 

GAT GAAGC T C T T ACAC C AAG T GT C ACG AG T C T G GAG AAC AGAT GGGT T GAG T AGT T G T T C T T AT AAAT 

TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 

TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCT.TTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 

ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 

TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 

CATGAGGGTTAAATATT GTAATAT GGATACT TGAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 

AATGAACGCTATTTGAGGACTCT GGTTGAAGGAGATTTAT TTAAATTTGAAGTAATATAT TATGGGAT 

AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAAl 

AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 

GAGAAGGCGTCCAC^AAAGAGGGGAGAAAAGGCGACGAATC^GGACACAGTGAACTTGGGAATG^^ 

GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 

CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 

T TTTAAAGAGTT T TT GTAAAATGATTT TGTACAAGTAGGATAT GAAT TAGCAGTT TACAAGTTTACAT 

ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 



APP ID=10063717 



Page 195 of 322 



FIGURE 42 

MGFNLTFHLS YKFRLLLLLTLCLTWGWATSNYFVGAI QE I PKAKEFMANFHKTL ILGKGKTLTN 
EAS TKKVELDNCPSVSPYLRGQSKL I FKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
S VEHNPLY INI T VDFW FGA 



m 

H 



m 



Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins . 

amino acids 191-202 
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FIGURE 43 



ft 



$1 



111 



GCT CAAGACCCAGCAG TGGGACAGC CAGACAGACGGCACGATGGCACT GAGCTCCCAGAT CT GGG 

CCGCTTGCCTC'CTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAAA2VAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAA.CLLLLLLLASLTSGSVFPQQTGQIjAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
C I FCCGCCHRS KCGMCCKT 



m 



Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 



Prokaryotic membrane lipoprotein lipid attachment site. 

r ^ amino acids 1-12 
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FIGURE as 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC T T CAACAC AAC CC C T C T TGTCACCAT ACAGCGAGAAGG GGGCAC TAT CAT AG T GACCC AAA 
AT CG T AAT AGGGAGAGAG T AGAC T T C C C AG ATGGAGG C T AC T C C C TGAAGC T C AG CAAAC T GAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAATGGCACC T G T GT GACCAATCTGACAT GCTGCAT GGAACAT GGGGAAGAGGAT GT GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
C TGGAGATGGGGAGAAAGT GATATGACCT TCAT C TGCG T T GCCAGGAACCC T GTCAGCAGAAAC T 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
G T T T C T G AAG AG AGAG AG AC AAGAAGAG TAG AT T GAAG AG AAG AAGAG AG T GGAC AT TTGTCGGG 
AAAC T CC TAACATATGCCCCCAT TCTGGAGAGAACACAGAGTACGACACAAT CCC TCACACT AAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGT T TACTC CAC T GT GGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 46 

MAGSPTCLTLIYILWQLTGSAASGPWELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKM.GQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVD I CRETPNIC PHSGENTE YDT I PHTNRT I LKEDPANTVYS TVE I PKKMENPHSLLTMPDT 
PRLFAYENVI 



Important features: 
Signal peptide: 

amino acids 1-22 



Vt^ Transmembrane domain: 

*i amino acids 224-250 

is 

%A Leucine zipper pattern. 

s amino acids 229-251 



X; N-glycosylation sites. 

r" "i 

fg, amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 

H 291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTGTGAAT TTTCAT TGAAAAACATCAGTGACATT CATCCAGAATCCT T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCACTTCGATTCTGAAGAAAACAAACATAGG 
Q CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
P CAGTCAGATAGTCATCGGTTTGCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
*fl T GTAGT T TAAT GGGAAT AAAATG TAAG TAT CAGTAGT T TGAAAAAAAAAAA 

iS, i 
■•ft a 

i ■ 

ci 

5 *i;r 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMA.SGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28/ 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE AQ 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAG T GACCCT GGCCC T TC TCCTAC TGGCAGGCC TGAC TGCC T T GGAAGCCAAT GACC 
CAT T T GC CAATAAAGACGAT CCC T TC TAG TAT GACT GGAAAAACC TGCAGC T GAGCGGACT GATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
C TACT T GC TGAGCACAGGAC TGGCC T CCAGGGATGGCCTGAAGCC TAACAC T GGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEANDP FANKDDP FYYDWKNLQLS GLI CGGLLAI AG IAAVLS GKCKYKS 
S QKQHS PVPEKAI PL I T PGS AT TC 

Important features: 
Signal peptide: 

amino acids 1-16 



Transmembrane domain: 

amino acids 36-59 

ists 1 

fY N-myristoylation sites. 

01 amino acids 41-47, 45-51, 84-90 

hi 

5 'f Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

^ s amino acids 54-67 



rii 
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FIGURE 51 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 

AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 

TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACT CAGCAGGCAGC T T T GGAATGAATCCTCAGGGAGCT CCC T GGGGT CAAGGAGGCAAT GGA 

GGGC CACCAAAC T T TGGGACCAACAC T CAGGGAGCTGT GGCC CAGCC T GGC TATGG T TCAGTGAG 

AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 

ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 

AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 

T GGCGGCAGCAG T GG T GGCAGCAG TGGCAACAGT GGT GGCAGCAGAGG TGACAGC GGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAAT TTTAAATCCAAGCTGGGTT TCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCT CTC 

GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVI RHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGS GESGI QGFRGQGVS SNMRE I SKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETS PGM 
FNFDTFWKNFKSKLGFINWDAINKDQRS SRI P 



Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

» 

|fj Casein kinase II phosphorylation site. 

O amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

W N-myristoylation site. 

^ amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE S3 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
? GAAGTCCTATATAACGATCT TCAACAAGAGTGCAAACATCATGC TTGACAAGTGGCAGCACCT GG 

CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
gj CAGAAATGCATC T TCAGC T T TGACAGCCAT T GT CAGGAGAGGCCCAG T GAATAT AT TGCCACCAT 

IP C T TGGAGC T CAGTGCC C T TGTAGAGAAAAGAAGCCAGCATAT CCT CCAGCACATGGAC TT T C T GT 

W ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
*f GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
I ; AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 

SI 

n CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
C| GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ill ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 

AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 



APP_ID=10063717 



Page 207 of 322 



FIGURE 54 

MSLLSLPWLGLRPVi^SPWLLLLLWGSWLI^II^WTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
L I T PTEE GLKDS TQHSAT YS QG FT VWLGP 1 1 P FI VLCHPDT I RS I TNASAAI APKDNL FIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
IAFIPFSAGPRNCIGQAFAMAEMKWLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 
VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II) , 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 

ATCGCATCAATTGGGAGTACCATCTTCCTC ATG GGACCAGTGAfiACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTG.CAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCAT GGCCAGT TT TAT GAAGC T T T GGAAGGCACT AT GGACAGAAGCTGGT GGACAGT T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGTVACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T CT GGAT GT TGTCCCACT GAAT T CCCATGAATACAAACCT AT TCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 56 



MGPWQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGIALIFCILQSIJ^LTWYSLSFIPFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 



M 

eg 

- - 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
AT TTT CAAGCT CAGT GT CTT CAT CCCCT C C C AGGAAT T CT C CACCTAC CGC CAGT GGAAGCAGAAAATTGT ACAAGCT 
GGAGATAAGGACCTTGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGT CAAGAT AT CT GAACAGCAGGCAGAAAAAAT T CT CAAGAGCATGGATAAAAACGGCACGAT GACCAT C GACTGG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
AT CT TT GATGT G GGT GAGAAT CTAAC GGT CCC GGAT GAGT TCACAGT GGAGGAGAGGCAGACGGGGAT GT GGT GGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
C AG GTCCATGCCT CC CGCAGCAACAACAT GGGCAT C GTT GGT GGCTT C ACT CAGAT GATT C GAGAAGGAGG GGCCAGG 
TCACT CT GGC GGGGCAAT GGCAT CAACGT CCTCAAAAT T GCC C CCGAAT CAGC CAT CAAAT T CAT G GC CTATGAG C AG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
s : . ATC GC CCAGAGC AGCAT CTACCCAAT GGAGGT C CT GAAGAC CC GGAT GG C GCT GC GGAAGACAGG C CAGT AC T CAGGA 

5! ! ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
Q GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
JSfj. AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 

CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
|fl TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
j . I G CT GT GAGCAT C AGCT ACGT GGT CTAC GAGAACCT GAAGAT CACC CTG GGC GT GC AGT C GCGGTGACGG GGGGAGGGC 

J** C GC CC GGCAGT GGACT CGCT GAT C CT GGGCC GCAG C CT GG GGT GT GCAGCC AT CT CAT T CT GT GAAT GT GCCAACACT 

%i AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
l^i. GTCCT GCT GAC CC CAGCAGACCCT CCT GTT GGT T C CAGC GAAGACC ACAGGCAT T CCTTAGGGT C CAG GGTCAGCAGG 

l'\ CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
%l TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 

CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
^, ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
Q TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
| ff CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
JJ; GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
|.*J CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ffs ACT GTT GGGAAAAGG GTT TT GT CCAGAAGGAC AAGC CG GACAAAT GAGCGACTT C T GT GCTTCCAGAGGAAGAC GAGG 

GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
ls| GGGACCAGCC CCACAT TCCACTT GT GTCAC T GCTT GGAACCTAT TTAT T TT GTAT TTAT TT GAAC AGAGTTAT GT C CT 

I % AACTATTTTTATAGAT TT GT T TAATTAAT AGCTT GT CATTTTCAAGTT CATTTTT TATT C ATATTTAT GTT CAT GGTT 



GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTT GGGGAAGGGCT T GCCC C CAGCCTTAG GATT T CAGG GTTT GACT GGGGGCGT GGAGAGAGAGG GAGGAACCT CAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTTCAAT AG GAT GCAAAGAT CAAT GCAAAAATTGTT ATATAT GAAC ATATAACT GGAGT C GT CAAAAAG 
CAAATTAAGAAAGAAT T G GACGT TAGAAGT T GT CATTTAAAGCAGC CTT CTAATAAAGT T GTTT CAAAG CT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRD YHLLHPVENI PE 1 1 LYWKHST I FDVGENLT VPDE FT VEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYE Q I KRLVGS DQE T LR I HERL VAGS LAGAI AQS S I YPMEVLKT RMALRKTGQYS GMLDCARR 
ILAREGVAAFYKGYVPNMLGI I PYAGIDLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLASYPLALVRTRMQAQAS I EGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains : 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE *Q 

GGAAGGCAGCGGCAGCTCCACTCAGGCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
CGT GATACAATGGC T GAAGGAAGGT GT T T TAGGCTTGGTCCAT GAGT TCAAAGAAGGCAAAGATG 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CAT C AC T T C T AAAG GC AAG G GG AAT GC T AAC C T T GAG T AT AAAAC T G GAG C C T T C AGC AT G C C GG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGC T TT GAGC T GAAC T C T GAGAAT GT GACCAT GAAGG T TGTGTCTGTGCT CTACAAT GT T ACGA 
T CAACAACACATAC TCCT G TAT GAT T GAAAATGACAT T GCCAAAGCAACAGGGGATATCAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCT GCTAAAC TCAAAGGCT TC TCT GT GT GT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
G TGCC T T GGCC AC AAAAAAGCATGCAAAG T CAT T G T T ACAACAGGGAT C T AC AGAAC TAT T T C AC 
C AC CAGAT AT GAC C T AGT T T TAT AT T T CT GGGAGGAAAT GAAT T CAT AT CT AGAAGT CT GG AGT G 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CTATCT TCAAAGACAT AT TAGAAGT T GGGAAAATAAT T CAT GTGAACTAGACAAGTGTG TT AAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACC TGGGGAGT GAGAGGACAGGATAGTGCATGT TCT T TGT CTC TGAATT T T T AGT TATATG TGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT T CC CAAC TGACAAAT GC CAAAGT T GAGAAAAAT GAT CAT AAT T T T AGC ATAAACAGAGC AGT 
C G G G GAC AC C G A T T T T AT AAAT AAAC T G AG C AC CTTCTTTTT AAAC AAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 60 

MASLGQILFWSIISIIIILAGAIALriGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTS FELNS ENVTMKVVS VL YNVT INNT YS CMI END I AKAT GD I KVTE SE I KRRS HLQLLNS KAS L 
CVSSFFAISWALLPLSPYIjMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N -myr i s toyl a ti on si tes . 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT T GG T C T T CAT T C AAT GAT GAG AC C T G C C T CAT GAT G AT AAAC AT G T T T G AC AAGAC C AAG T C A 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
p| CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
MJ AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

u 

, , GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACAXCCGGCTCAGCTTCGAGGACTTCGTCA 

"•si 

CCATGACAGCT TC TCGGATGCTATGACCCAACCATC T GTGGAGAGTGGAGT GCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGTGAGGACCGGGGCT GAGGCCACACAGATAGGGGCCT GATGGAGGAGAGGATAGAAGTTGA 
ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 



0 



11 AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTA.ee 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT TT GGGGC CAAAAGT CCAG TGAAAT T GTAAGC T TCAATAAAAGGATGAAAC TC TGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALVNCNWSSFNDETCLM^ 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein: 
Signal peptide: 
amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCT GGGG TAT CAATAAT CAAC CCCCAGGT ACCAGCT GGGGAAATAT T AAT C GGTATCCAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
||1 GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
E? TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
JjJ T GC T CAT CT CCAATAAAAT AAAAGCAC TT AT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

2'i AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 6d 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 

DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPW 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 

PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 

WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6* 
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few 

01 



ft 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATCGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGG 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites . 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 

ACGGACCGAGGGT TCGAGGGAGGGACACGGACCAGGAACCTGAGC TAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
S GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
f! GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAA.TAGCAGGGAAGAAGGTACTTCAA 
W AGAC TC TGCCCC TGAGGT CAAGAGAGGATGGGGC TAT T CAC T T T TAT AT AT T TATATAAAATT AG 

TAGT GAGAT GTAAAAAAAAAAAAAAAAAA 

H 

m 

•Fi; 3 
Ml 
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FIGURE 68 

MANPGLGLLLALGL PFLLARWGRAWGQ I QT TSANENSTVLPSS T S S S SDGNLRPEAI TAI I WFS 
LIAALLIJWGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 8 6-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6Q 



GC.C AG GAATAACT AGAGAGGAACAATGGGGTTATT CAGAG GT TTT GTT TT C CT CTT AGTT CT GT GCCT GCT GCACCAG 
TCAAATACTT CCTTCATTAAGCTGAATAATAATGGCTTTGAAGATATT GT CATTGTTATAGAT CCTAGTGTGCCAGAA 
GAT GAAAAAATAATTGAACAAATAGAGGATAT GGTGACTACAGCTTCTACGTACCT GTTTGAAGCCACAGAAAAAAGA 
TTTTTTTTCAAAAAT GTATCTATATTAATTCCT GAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATGAA 
AAC CAT AAACAT G CT GAT GTTAT AGTT GCACC ACCTACACT C CCAGGTAGAGATGAAC CATACAC CAAGCAGT T CACA 
GAATGT GGAGAGAAAGGCGAATACATT CACTTCACCCCTGACCTT CTACTTGGAAAAAAACAAAATGAATATGGACCA 
C CAGG CAAACT GT TT GT CCAT GAGT GGGCT CACCT C CGGT GGGGAGT GT TT GAT GAGTACAAT GAAGAT CAGC CT TT C 
TACCGTGCTAAGT CAAAAAAAATCGAAGCAACAAGGTGTT CCGCAGGTAT CTCTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCT GTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACT GTATGGAAAAGATT GTCAATTCTTT 
C CT GAT AAAGTAC AAACAGAAAAAGCAT C CAT AAT GT T TAT GCAAAGTATT GAT T CTGT T GTT GAATTT T GT AACGAA 
AAAACCCATAATCAAGAAGCTCCAAGCCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAAT 
T CT GAGGAT T TTAAAAACAC CATACC CAT GGTGACACCACCTC CTCCAC CT GT CTT CT CATTGCT GAAGAT CAGT CAA 
AGAATTGTGTGCTTAGTTCTTGATAAGTCT GGAAGCATGGGGGGTAAGGACCGCCTAAAT CGAAT GAATCAAGCAGCA 
AAACAT TT CCT GCT GCAGACT GT T GAAAAT GGAT CCTGGGT GGGGAT GGTT CACTT TGATAGT ACT GCCACT ATT GT A 
AAT AAGCTAAT CCAAATAAAAAGC AGT GAT GAAAGAAACAC ACT CATGGCAGGAT T ACCT ACATAT CCT CT GG GAGGA 
ACTT C CAT CT GCT CT GGAATTAAATATGCAT TT CAGGT GATT GGAGAGCT ACAT T CCCAACT C GAT GGAT CC GAAGTA 
CTGCT GCTGACTGAT GGGGAGGATAACACTGCAAGTTCTT GTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
TTTATT GCT T T GGGAAGAGC T GCT GAT GAAGCAGT AATAGAGAT GAGCAAGAT AACAGGAGGAAGT CATTT TT AT GTT 
T CAGAT GAAGCTC AGAACAAT GGC CT CAT T GAT GCTTT T GGGGCT CTTACATCAGGAAATACT GAT CT CT C CCAGAAG 
T CCCTT CAGCT CGAAAGTAAGGGATT AACACTGAATAGTAAT GC CT G GAT GAACGACAC T GT CAT AAT T GATAGTACA 
GT GGGAAAGGACACGT T C T TT CT CAT CACAT GGAACAGT CT GC CTCCCAGTATT TCT CT CT GGGAT C C CAGT GGAACA 
ATAATGGAAAATTTCACAGTGGAT GCAACTTCCAAAAT GGCCTATCTCAGTATTCCAGGAACT GCAAAGGT GGGCACT 
T GGGCATACAAT CT TCAAGCCAAAGCGAACC CAGAAACAT TAACTATTACAGTAACTT CTC GAGCAGCAAATT CTT CT 
GT GCCT CCAAT CACAGT GAAT GCTAAAAT GAAT AAGGACGT AAACAGTT T CCC CAGCCCAAT GATT GT TTAC GC AGAA 
ATT CTACAAGGATAT GT ACCT GTT CT T G GAGC CAAT GT GACT GCTT T CATT GAATCACAGAAT G GACATACAGAAGTT 
TTGGAACTT TT GGATAAT G GT G CAGGC GCTGAT T CTT T CAAGAATGAT GGAGT CTACT C CAGGTATTT TACAGCAT AT 
ACAGAAAAT GG CAGAT ATAGCTTAAAAGTTC GGGCT CAT GGAG GAGCAAACACT GCCAGGCTAAAATTACGGCCT CCA 
CT GAATAGAGCC GC GT ACAT ACCAGGCT G GGT AGT GAAC GGGGAAAT T GAAGCAAACC CGCCAAGAC CT GAAAT TGAT 
GAGGAT ACT CAGACCAC CT T GGAGGATT T CAGCC GAACAGCAT C CGGAGGT GCATTT GT GGT AT CACAAGTC CCAAGC 
CTTCC.CTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
ACAT GGAC AGCAC CAGGAGAT AAT TT T GAT GTT GGAAAAGT T C AACGTTATAT CATAAGAAT AAGT GCAAGT ATTCTT 
GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 
AGCTTT GCATTTAAACCAGAAAATAT CT C AGAAGAAAAT GC AAC CCAC AT AT TTATTGCCATTAAAAGT ATAGAT AAA 
AGCAATT T GACAT CAAAAGTAT CCAACAT T GCAC AAGT AACT T T GTTT AT C CCT CAAGCAAAT CCT GAT GACAT TGAT 
CCTACAC CTACT CC TACT CCTACT C CTACT CCT GATAAAAGT CATAATT CT GGAGT TAAT AT TT CTACGCT GGTATT G 
T CT GT GATT GGGT CT GTT GT AAT T GT T AACT TTATT T TAAGT ACCACCATT TJ3AACCTTAAC GAAGAAAAAAAT CTT C 
AAGT AGACCTAGAAGAGAGTT TT AAAAAACAAAACAAT GTAAGT AAAGGATATT T CT GAAT CTT AAAATT CAT CCC AT 
GT GT GAT CATAAACT CATAAAAATAATTT TAAGAT GT CGGAAAAG GAT ACTT TGAT TAAAT AAAAACACT CAT G GATA 
TGTAAAAACT GTCAAGATTAAAATTTAATAGTTT CATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAACAAAG 
AT CCT TT TT CAT ACT GAT AC CT GGT T GTATATTAT TT GAT GCAACAGTT TT C TGAAAT GAT ATT T CAAAT T GCAT CAA 
GAAATTAAAATCAT CTATCT GAGTAGTCAAAATACAAGTAAAGGAGAGCAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQV I GE LH S QLDG SE VLL LT DGE DNT AS SCI DE VKQS GAI VH F I ALGRAADE AV I EMS KIT GG 
S HFYVS DEAQNNGL I DAFGALT S GNTDLS QKSLQLE SKGLTLNSNAWMNDTVI I DS TVGKDT FFL 
I TWNSLPPS I S LWDP SGT IMENFTVDAT SKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTS R 
AANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVWGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKI ILTWTAPGDN 
FDVGKVQRYIIRISASILDLRDSFDDALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains : 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



FIGURE 71 

CT CCT TAGGT GGAAACC CT GGGAGT AGAGT ACT GACAGCAAAGACC GGGAAAGAC CAT AC GT CCCCGGGCAGGGGTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCTCAGAAGTGACTTGCCT GAGGGTGGACCAGAAGAAAGGAAAGGT CCCCTCTTGCTGTTGGCTGCACAT CAGGAA 
GGCTGTGATGGGAAT GAAGGTGAAAACTTGGAGATTTCACTTCAGT CATTGCTTCT GCCT GCAAGATCATCCTTTAAA 
AGT AGAGAAGCT GCT CT GT GT GGTG GTTAACT CCAAGAGGC AGAACT C GT T CTAGAAGGAAAT GGAT GCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CT GGCACT GC CCAG GGCCAACAGC C C CAC GGGGAAGGAG GG GT ACCAGGC C GT C C T TCAGGAGT GG GAGGAGCAGCAC 
C GC AACT ACGT GAGCAGCCT GAAGC GGCAGATC GCACAGCT CAAG GAGGAGCT GCAGGAGAGGAGT GAG CAGCT CAGG 
AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 

CTGGC CTT CCT GCACTCGCAGGT GGACAAGGCAGAGGT GAATGC T GGC GT CAAGCT GGCCAC AGAGT AT GCAGCAGTG 
CCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCGAGGAGAAGCCTGTG 
AGGAAGGACAAGC GG GAT GAGTT GGT GGAAGC CATT GAAT C AGCCTT GGAGAC CCT GAACAAT CCT G CAGAGAACAGC 
CCCAAT CACC GT CCTT ACACGGC CT CTGAT T T CATAGAAGGGAT CT ACCGAACAGAAAG GGACAAAGGGACATT GT AT 
GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAA 
GT GAAAAAT GAAAAGCT CAACATGG C CAACACGCT T AT CAATGT T ATC GT GCCT CTAG CAAAAAG GGT G GACAAGTTC 
C GGCAGTT CAT GCAGAAT T T CAGGGAGAT GT GCAT T GAGCAGGAT GGGAGAGT C CATCTCACT GTT GTTT ACTT T GGG 
AAAGAAGAAAT AAAT GAAGT CAAAGGAATACTT GAAAACACTT CCAAAGCT GC CAACT T C AGGAACTT TAC CT T CATC 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTTT CTGT GAT GT GGACAT CT ACT T CACAT CT GAATT CCT CAAT ACGT GTAGGCT GAATACACAGCCAGGGAAGAAG 
GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 
CAGCAGCT GGTCATAAAGAAGGAAACTGGATTTT GGAGAGACTT TG GATTT GGGAT GAC GT GTCAGTAT CGGTCAGAC 
TT CAT CAAT ATAGGT GGGTT T GAT CT GGACAT CAAAGGCTGG GGCGGAGAGGAT GTGCAC CTT TAT CGC AAGT ATCT C 
CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAG 
CT GACCCCC GAGCAGT ACAAGAT GTGCAT GC AGT CCAAGGC CAT GAAC GAGGCAT CCCACG GCCAGCT GGGC AT GCT G 
GTGTTCAG G CAC GAGATAGAG GCT CAC CTTCGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATCAACT C C CAGA 
GAAGGATT GTGGGAGAC ACTT TTT CT T T C CTTTT GCAATT ACT GAAAGT GGCT GCAACAGAGAAAAGACTT C CAT AAA 
GGAC GACAAAAGAATT GGACT GAT GGGT CAGAGAT GAGAAAGCCTCCGATTT CT CT CT GTT GG GCT TT TT ACAACAGA 
AATCAAAAT CT CC GCT TT GCCT GCAAAAGT AACC CAGTT GCAC CCT GT GAAGT GTC T GACAAAGGCAGAAT GCTTGT G 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATT CAT GATT T AAGAG CAGT T TT GT AAAAAATT CATTAGCATGAAAGGCAAGCATATTTCT CCT CATAT GAATGA 
GC CTAT CAGCAGGGCT CT AGT TT CTAGGAAT GCTAAAATAT CAGAAGGCAGGAGAGGAGATAGGCTTATTAT GAT ACT 
AGT GAGT ACAT T AAGT AAAATAAAAT GGAC CAGAAAAGAAAAGAAACCATAAAT AT CGT GT C AT AT TTT C CCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGT GAGTTATAGT CTGCTTATTTAATTAC CACTTTGCAAGC CTTACAAGAGAGCACAAGTT GGC CTAC 
AT TTTT ATATT TTT TAAGAAGAT ACT TT GAGAT GCATTATGAGAACT TT CAGTT CAAAGCAT CAAATT GAT GCCATAT 
C CAAGGACAT GC CAAAT GCT GAT T CT GTC AGG CACT GAAT GT CAGGCAT TGAGAC ATAGGGAAGGAAT GGT T T GTACT 
AATACAGACGTAC AGAT ACTT T CT CT GAAGAGTATTT T C GAAGAGGAGCAACT GAACACT GGAG GAAAAGAAAATGAC 
ACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAGAAACCACATTTT 
CT C CTCAGAAGT AGGGACCGCTT T CTTACCT GT TTAAATAAAC CAAAGT AT ACCGT GT GAAC CAAACAAT CT CTTTT C 
AAAACAGGGT GCT CCT CCT GGCTTCT GGCTT CCATAAGAAGAAATGGAGAAAAATATAT ATATAT AT AT AT AT ATTGT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAATTATTTTTT AAAT TAAGCAGTT CT ACT CAAT CAC CAAGAT GCTT CT GAAAATT GCATT TTATT ACCATTT 
C AAACTATTTTTTAAAAATAAAT ACAGTTAACATAGAGT GGT TT CTT CATT CAT GT GAAAATTAT T AGCC AG CACCAG 
AT GCAT GAG CTAAT TAT CT CT T T GAGT CCTT GCT T CT GTT T GCT CACAGTAAACT CATT GTT TAAAAGCTT CAAGAAC 
AT T CAAGCT GT T GGT GT GTTAAAAAAT GCATT GT ATT GATTT GTACT GGT AGTT TATGAAATTTAATTAAAACAC AGG 
C CAT GAATGGAAGGT G GTATT GCACAGCTAATAAAATAT GATTT GT GGAT AT GAA 



FIGURE 72 



MMMWRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRP YTAS DF I E G I YRTERDKGT L YE LT FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
T FI QLNGE FSRGKGLDVGARFWKGSNVLLFFCDVD I YFT SE FLNTCRLNTQPGKKVFYPVL FSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 

KYLHSNLIVWTPWGLFHLWHEKRCMDEM^ 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142/ 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 



GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCC AT GATACCCT AC T GAAC ACCGAAT CCCC TGGAAGCC CACAGAG AC AGAGAC AGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
& ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 

2 GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 

f J * 

M T GAAGCCACAT T TGCAGAGCTCCACAT T G TACAT TAT GAC TCTGAT T CCTATGACAGCT T GAGT G 

AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
H AATATAGC T TAT GAACACAT TC TGAGT CAC TTGCAT GAAGT CAGGCAT AAAGATCAGAAGACCTC 

N AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
fjfi AT T T CAAT GGAACAGC TGGAAAAGC T TCAGGGGACAT TGT T CT CCACAGAAGAGGAGCCCTC TAA 

p GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
111 TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
& TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 

mi 



CCGAAAGAGT G T GGTC TT CACC T CAGCACAAGCCACGAC T GAGGCATAAAT TCC T TCT CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGAT GT CC TCC TTCCCC TGGACAT CT CT TAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
AT GAGGAAATCGCTGTGTTGT TAATGCAGAGANCAAACT CTGT TTAGT TGCAGGGGAAGT T TGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TCC T TAGGATAAAGAGT T GC TGTTGAAG T T GTATAT T T T T GAT CAATATAT T TGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QN YRALQPLNQRMVFAS FI QAGS S YTTGEMLS LGVG I LVGCLCLLLAVY FI ARKIRKKRLENRKS 
WFTSAQAT TEA 

Important features of the protein: 
Signal peptide: 
amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, ■ 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
T CTGGTGGT TTGCC T AAACCT GCAAACAT CACCTTC T TATCCAT CAACATGAAGAATGTCC TACA 
ATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATTTCATCACAA 
AT T GGCCCACCAGAGGTGGCAG TGAC TACAGAT GAGAAGT CCAT T TC T GT TGTCC TGACAGC TC C 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCGAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
H AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 

M CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 

S!3 

tctgtgatgggctattccatctaccgatatatccacgttggcaaagagaaacacccagcaaattt 

t .. s gattttgatttatggaaatgaatttgacaaaagattctttgtgcctgctgaaaaaatcgtgatta 

\! actttatcaccctcaatatctcggatgattctaaaatttctcatcaggatatgagtttactggga 

M aaaagcagtgatgtatccagccttaatgatcctcagcccagcgggaacctgaggccccctcagga 

^ ggaagaggaggtgaaacatttagggtatgcttcgcatttgatggaaattttttgtgactctgaag 

5^ aaaacacggaaggtacttctctcacccagcaagagtccctcagcagaacaatacccccggataaa 

Iff acagtcattgaatatgaatatgatgtcagt^ccactgacatttgtgcggggcctgaagagcagga 

g| gctcagtttgcaggaggaggtgtccacacaaggaacattattggagtcgcaggcagcgttggcag 

if tcttgggcccgcaaacgttacagtactcatacacccctcagctccaagacttagaccccctggcg 

If caggagcacacagactcggaggaggggccggaggaagagccatcgacgaccctggtcgactggga 

hi 

tccccaaactggcaggctgtgtattccttcgctgtccagcttcgaccaggattcagagggctgcg 
agccttctgagggggatgggctcggagaggagggtcttctatctagactctatgaggagccggct 
ccagacaggccacCaggagaaaatgaaacctatctcatgcaattcatggaggaatgggggttata 
tgtgcagatggaaaac tga tgccaacacttccttttgccttttgtttcctgtgcaaacaagtgag 
tcacccctttgatcccagccataaagtacctgggatgaaagaagttttttccagtttgtcagtgt 
ctgtgagaattacttatttcttttctctattctcatagcacgtgtgtgattggttcatgcatgta 
ggtctcttaacaatgatggtgggcctctggagtccaggggctggccggttgttctatgcagagaa 
agcagtcaataaatgtttgccagactgggtgcagaatttattcaggtgggtgt 
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FIGURE 76 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYOTSVLNTKSNRTWSQCVTOTTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FW YVLP I S I TVFLFS VMGY S I YRY IHVGKEKHPANL I L I YGNE FDKRFFVPAEK 
I VI NFI TLNI SDDS KI SHQDMS LLGKSS DVS SLNDPQPSGNLRPPQEEEEVKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCGCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGAT CAAT GAAAAGGCT GC AGAT AAGCTGGGAT C TACCCAGAT CGT GAAG ATCCTAAC TCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T T T AC AC C AAAGGT GACC AAC T TAT AC T CAAC T T GAAT AACAT C AGC T C T GATC GGAT C CAGC T G 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCA GTGA AGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWT FTLLCGLLAATL IQATLSPTAVLI LGPKV IKEKLTQELKDHNATS I LQQLPLLSAMREK 
PAGGIPVLGSLVNTVLKHIIWLKVITANILQLQVKPSANDQELLVKIPLDMVAGFNTPLVKTIVE 
FHMTTEAQAT IRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLS FLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEAS FNGMYADLLQLVfCVP I SLS I DRLE FDLLYPAIKGDT I QLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S S DR IQLMNSG I GW FQPDVLKNI I TE 1 1 HS I LLPNQNGKLRS GVPVS L VKALG 
FE AAE S S LTKDAL VL T PAS L WKP S S P VS Q 



ft 

S Important features of the protein: 



Signal peptide: 

%l amino acids 1-21 

* ! " N-glycosylation sites. 

m amino acids 48-51, 264-267, 401-404 

C| Glycosaminoglycan attachment site. 

VM amino acids 412-415 ■ ■ 



LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGAGC 
TTCAGGC T GAAGACAAGGGAGCAGT CCCT GAAGACGCT T C TACTGAGAGGT CT GCCATGGCC T C T 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGT C T GT GAAAAACAG TGGACAGCAC CCC GAGGGCCACAGGTGAGGGACACT ACCAC T GGAT CGT 
GTCAGAAGGT GC TGC T GAGGAT AG AC T GAC T T T GGCCAT T GGAT T GAG CAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGGTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCAGAGTGTCCA 
GACTAAT T T GT GCAT GAACTGAAATAAAAC CATCCTACGGTAT C CAGGGAACAGAAAGC AGGAT G 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 
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FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGAS IVTAVGFSKGLWMECATHSTGI TQCD 
I YSTLLGLPADIQAAQAMMVTS SAI S SLACI I SWGMRCTVFCQESRAKDRVAVAGGVFFILGGL 
LGFI PVAWNLHG ILRDFYS PLVPDSMKFE I GEALYLGI I S SLFSLI AGI I LC FSCS S QRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide : 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site, 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 

CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 

GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 

GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 

ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 

CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 

CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 

ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 

GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 

AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGGCCAACCAGTGTGTCCTCTGCAGCTGC 

ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 

ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 

AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 

C AG AC CC AAGGGAGC AGGC AGC ACAACT GT CAAGATCGT CC T GAAGGAG AAACAT AAGAAAGC C T 

GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 

CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 

CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 

C T CGGACT T GGT GGAGATCTACCTCT GGAAGCT GGTAAAAGATGAGGAAAC T GAGGCTCAGAGAG 

G TGAAGT AC CTGGCCCAAGGCCACACAGCCAGAATC T T CCAC T T GAC T C AGATCAAGAAAGTCAG 

GAAGC AAGACT T C CAGAAAGAGGCAC AGCAC T T CCGAC T GC TC GC T GGC CCC CACGAAGGT C AC T 

GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 

ACATAACAAAGACC TAACAGT T GCAG ATAT GAGC TGT ATAATT G T T GT TAT T ATATAT T AAT AAA 

T AAGAAGT TGCAT TACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVS CYRLHC PP VHC PQPVTEPQQCCPKCVE PHT P S GLRAPPKS CQHNG TMYQHGE I FS AHEL F 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 



in 



M 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGT T TC C ATGCC GTGAGGTCC AT T C ACAGAAC AC AT CC ATG GCT CTCATGCTCAGT T T GGTT CT GAGT C 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTT CT CT AG CGT GGT CC ACCT CT AC AGGG AC GGGAAGGACC AGCC AT TT ATGC AG AT GCCAC AGT AT C 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 

a , GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 

H* 

AC AGGT GT CAGC AC TGGGCT C AGT TC CT CT C AT TT C C AT C ACGGGAT AT GTT GAT AGAGACATCC AGCT AC 

f s% TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

fen J 

HI AC AGACTCC AGGAC AAAC AG AGAC AT GC AT GGCCT GTT TGAT GT GG AG AT C TCT CT GACCGTCC AAGAGAA 

|d CGCCGGGAGC AT AT CCT GTT CC AT GC GGC ATGCT CATCT G AGCC GAGAGGT GG AAT CCAGGGTAC AG ATAG 

SI GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

M ? TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 

'"'I GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 

* AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 

Cl 

?|J CCTC AC TCT GAG AAGAGATT T ACAAGGAAG AGT GT GGT GGCT TC TC AG AG TTTC CAAGCAGGGAAACATT A 

^?r 5 CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
S| AGGAGT ACGT GACT T T GT CT CCCGAT C ATGGGT ACT GGGT CC TC AGACTGAATG GAGAAC AT TT GT AT TT C 

ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
pj TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CC AGTC ACCC AG GAAT C AGAGAAAGAGGCC TCTT GGCAAAGGGC C TCT GC AATCCC AGAGAC AAGC AAC AG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTT CTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTT CCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CTGACATTACATTTAGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAG 
AACCGT C AGGAAT T CC CATCTC AC AGGCT GTGGT GT AG ATTAAGT AGAC AAGG AAT GTG AAT AATGCT T AG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 
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FIGURE 8d 

MALMLS LVLS LLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLS PKTNAEAMEVRFFRGQFS SWH 
LYRDGKDQPFMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGLYGCRI S SQS YYQKAI WELQ 
VSALGSVPLI S I TGYVDRDI QLLCQSSGWFPRPTAKWKGPQGQDLS TDSRTNRDMHGLFDVE I S L 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I VI C P VTQE S E KE AS WQRAS AI PE T SNS E S S S QAT T P FLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8s 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACT TCT T TCGTATGGAGAAAGGAAGTATAAAATGGAAT TATAAACATCACCGGCTCTCTGTGAA 
£ TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
|X; AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
Ifl GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
W GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
^ ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

H 

I ; GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
t GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
13 GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
Iff CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
S CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
ff'i TCGGCAAGGCCAGCAGCGGGCGT GGGAGATACGGGCATAGAGGAT GCAAACGC TGTCAGGGGT TC 

AGCC T CT CAGGGGCCCCTGACT GAACCT T GGGCAGAAGACAGT CCCC CAGACCAGCC TC CC CCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGA GAAACT GCAGAGACTCACCCTGAT TGAGGGATCACAGCCCCT CCAGGCAAGGGAGAAGTCA 
GAGGCT GAT TCT TGTAGAAT T AACAGCCC T CAACGT GAT GAGC TAT GAT AACACT AT GAAT TAT G 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
T CCCT T T TAT T T T T T TAAC TAAAAGACAGACAAAT TCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFFMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide : 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-37 0 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 

CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCT TCCTGCTGTT TC 

T CATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACT TACT TCAAGGAAT GGACCTGT 

TCTTCGTCTC CATC TC TGCCCAGAAGCT GCAAGGAAATCAAAGACGAAT GT CC TAGTGCAT T T GA 

TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 

GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 

GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 

CAACTACAACACCT TTGGATCT GCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 

ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 

AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 

GT T T GGC AT CTACC AGAAATATC CAG T GAAATATGGAGAAGGAAAGT GT T GGAC T GACAACGGCC 

CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 

GGCCAGCGGGAATTCACT GCGGGAT T TGT T CAGT TCAGGG TAT T TAAT AAC GAGAGAGCAGCCAA 

CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

GAT AC T T T CCAGAGGCCAGTCCC CAGCAGT GT GGAGAT T TT TCTGGTT T TGAT TGGAGT GGATAT 

GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 

T TGAGAGT T T TGTGGGAGGGAACCCAGACC T C T CCTCCCAACCATGAGAT CCCAAGGATGGAGAA 

CAACTTACCCAGTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGA 

AAAAAA 



FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
Y QT FCDMTS GGGGWTL VAS VHENDMRGKC T VGDRWS S QQG S KADYPE GDGNWAN YNT FGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYS PYGQRE FTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHC I GGGGY FPEAS PQQCGD FSGFDWS GYGTHVGYS S SRE I TEAAVLL FYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCT TTAT TCGTCACCGTCTTTGGGC TGAAAAAGAAACCCT TCTGAT TACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
T CT T GAGT C TGGGAT TAT CCGCATT G T AT TTAGTGC T TT GTAAT AAAAT AT GT TT TGTAGTAACA 
T TAAGAC T T ATATACAGT T T TAGGGGACAATTAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLI S I GAHCSASVALS FFI FERWECTTYWYI FVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 



C T G G GAC C C C GAAAAGAGAAG G G GAG AG C G AGG GGAC GAGAGC G GAG G AGG AAGATGC AAC T GAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGT TCCACCAGGAACAGCAGAT CTTCAT CGAAGCCAAGGCCTCCAAAATCT TCAACTGCCGGATG 
ff| GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTGGCTTTGCACCCACGACCCAGCCAAGATCTG 
|j[J CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 

N TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 

jL. ! 

F t CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 

\J 

TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 

Q AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

|fl AGAAAGGGTCCCAAG T GC TGGT CCCAACCT GAAGCT GTGGAGTGACTAGAT CACAGGAGCAC TGG 

|*l AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 

?I TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 

O 

Hi" CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GC ACACCCAT CC T TAAGC TAAGAC AGGAC GAT T GTGGTCC TCCCACAC TAAGGC GACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE QS 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACT TCCCAAACCGCAGACTACATCTT TAGAGGAAGCACAACTGTGCCT 
T TT T CT GAAAAT CCC T T T T T C T GGTGGAAT TGAGAAAGAAATAAAACT AT GCAGATA 
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FIGURE Q4 

MTAAVFFGCAFI AFGPALALYVFT I AIE PLRI I FL I AGAFFWLVSLL I S S LVWFMARVT I DNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AAT T T T T CACCAGAGTAAACT T GAGAAACCAACTGGACC T TGAGTAT TGTACATT T TGCC TCGT G 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACT ACCAAACCAACAGCAGTGAAAT CAGGTC TTTCCTTCTT TAAGT CTGATACCAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAAT TT T T GT CACACAAC T T GGAGCCCAGGGCAC TATCCT AAGCTCAGAGGAAT TGCCACAAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
63 T AAT CCAGAT GTCCAGGAT GGAAGC C T TCCAGCAGGAGGAGCAGGT GT AAATCC TGC CAC CCAGG 

GAACC CCAGCAGGC CGC C T CC CAACT CCCAGTGGCACAGATGACGACT TT GCAG TGACCACCCC T 
G CAGGCAT CCAAAGGAGC ACAC AT GC CAT C GAGGAAGC CAC C AC AGAATC AGCAAAT GGAAT TC A 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
h*k\ AT T GAT TAT AT TAT GGAATAGAT TGAGACACAT TGGATAGTCT T AGAAGAAATTAAT TCT T AAT T 

T AC C TGAAAAT AT TCT TGAAAT T T C AGAAAAT ATGT TC TATGT AGAGAATC C C AACT T T TAAAAA 
JL, CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 

S TAAAACAT AT T TGGAAAAC TGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

¥3 

m AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

W 



p 
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FIGURE q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQ I FT S L 1 1 HS L FPGG I L PT S QAGANPDVQDGSL PAGGAGVNPAT QGT PAGRL PT P S G 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 



m 



i 



Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGT GCT T G TGCACGT GAAGGAT GAGAAT GACCAGG TGCCCCAT T TCT CTCAAGC CAT C TAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
^ CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
g CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CI CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
||i CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
y T GCAGAGGGAAACC TC T ACG T GACCAGAGAGCTGGACAGAGAAGCC CAGGCT GAGTACCTGC TCC 

Zi AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
^ ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
H CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
%l ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
K CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 

CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
VI ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
0 CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
gi GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
-J| GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
**f AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
IM ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTGCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGGCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
ATTTTCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCGATGGGAGTGCTGCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 

STSLDHALERT YQLLVQVKDMGDQASGHQATATVEVS I IES TWVSLE P IHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQN I LLLVLAMDLAGAEGGFS S TCE VEVAVTDINDHAPE FI TSQI G 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQS VAKLVGPGPGPGATATVTVLVERVMPP PKLDQE S YEASVP I SAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGbTYTVLVEAQDTALTLAPVPS 
QYLGTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REHIIPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLI 
LI FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide : 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGATT C T TC CCAGGAACACAAACGT AGGAGACCCACGC TCCT GGAAGCACCAGCC T TTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
AT C T AGGACCCAGGCATC T TGC TT T CCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAATG 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGT CACCAAC TC T GAGT T CCAT ACAACC T CCAGTGGGAT CAGCACAGCGACCAACTC TGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAAC TCTGAGT CCAGCACACCC TCCAGT GGGGCCAGCACAGTCACCAACT 
Ijl CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
pi AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAAC T CTGACT CCAGCACAACC T CCAGT GGGGC TAGCACAGCCACCAACT C TGAGTCCAGCACAA 
B CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
H? GCCACCAAC TC TGAGTCCAGCACAACC TCCAG T GGGGCCAGCACAGCCACCAAC T CT GAGT CCAG 

§y AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
I/} GCACAGC CACCAACT C TGAC TGCAGCACAGTGT CCAGTGGGGCCAGCACT GCCACCAAC TC TGAG 

8 "? TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
H GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
S| CTGAGTC CAGCACAGT GT C CAGT GGGAT CAGCACAG TCACCAAT T CT GAG T CCAGCACACCC T C C 

p AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
f** CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
S CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
PI GCC ACCAACT C T GAC T CCAGCACAACCTCCAG TGAGGCCAGCACAGC CACCAAC TCT GAGTC TAG 

P CACAGTGT CCAGT GGGATCAGCACAGT CAC CAAT TC TGAGT CCAGCACAACC TCCAGT GGGGCCA 

g| ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
'fit ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
jj{ GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 

TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G TGGAGT CC TAAC TGGT TCT GGAGGAGAC CAGTATCATCGATAGCCAT GGAGAT GAGCGGGAGGA 
ACAGCGGGCCC TGA GCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCT TGAAGAAGGTATTCC TCACCT T TCT TGC C T T TACCAGACACT GGAAAGAGAATACTATAT 
T GC T CAT T T AGC T AAGAAAT AAAT ACATC T CAT C TAAC AC AC AC GAC AAAGAGAAGC TGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT CT GGCAT TCAAAATC TC CACAGTAAAAT CCAAAGACC TCAAAAAAAAAAAAAAAAAAAftAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGAS TATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAG TATNS E S S TVS S G I S T VTNSES S T PS S GANTATNS E S S TT S S GANTATNS ESS TVS S GAS TA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTS SEAS TATNSESSTVS SGI STVTNSES 

N ? S T T S S GANTATNS GS S VT SAGS GT AAL TGMH T T S HS AS T AVS E AKPGG S L VP WE I FL I TLVS WA 

AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 

III AMEMSGRNSGP 

Signal peptide: 

f'! amino acids 1-20 

?H Transmembrane domain: 

|ff amino acids 510-532 

%i? ■ 

| 

Ftl 
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FIGURE lOl 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGC AG T T TC TAGC T AC T GCACGTACT T CAT AAAGCAGGAC T C TAAAAGC T T TGGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 

jN CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 

rasas. 

M GAT T AT AAC T GGGGAT GCAT T T G T T C C T GG AGAAAGAAG T G T CAT TAT CAT G AAC CAT C G GAC AA 

5f GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 

£H 

I , i TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 

%V 

%l C T T CAT TCAT AGGAAAT GGAAGGAT GACAAGAGC CAT T T C GAAGACAT GAT T GAT TAC T T T TGT G 

M. AT AT T CACGAACCAC T TCAAC TCCT CATAT T CCC AGAAGGGACT GAT C TC ACAGAAAACAGCAAG 

^- T CT C GAAGT AAT GCATT TGCT GAAAAAAAT GGAC TT CAGAAATATGAATATGT T T TACATCCAAG 

1, AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 

iff AT AT CAC TGT GGCGTATCC TCACAACAT T CCT CAAT CAGAGAAGCACC TCC T CCAAGGAGAC T T T 

p CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 

111 CCT T C AACT CT GG T GCCACAAAC GGT GGGAAGAGAAAGAAGAGAGGC TGCGTTCCTTC TATCAAG 

W GGGAGAAGAAT T T T TAT T T TAC C GGACAGAGT GT CAT T CCACCT TGCAAGTC TGAAC T CAGGGT C 

II 

CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT T TG TACAGTCT TGT TAAGT GGTAT T T TAT AAT CACCAT TGTAAT CT T T GTGCTGCAAGAGA 
GAATATT TGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACT T T TACACAAACAGCCACAT 
TTAAATT CAAAGAAAAATGAGTAAGATTATAAGGTT TGCCATGT GAAAACCTAGAGCATATTTTG 
GAAATGT TCTAAACC TTTCTAAGCT CAGAT GCAT TT TTGCATGACTATGTCGAATATT TCTTACT 
GCCAT CAT TAT T TGT TAAAGATATT T T GCACT TAAT TT TG TGGGAAAAATAT TGC TACAAT T TT T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATATTAT TAAACAATCATCAGGCT TT TAAA 
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FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYS YLRLEKI CLKAS LKGVPG FGWAMQAAAY I F I HRKWKDDKS H FEDM I D Y FCD I HE P 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS I LYWTLFSPAMCLLI YLYSLVKWYFI I TIVI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 1Q3 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACC TC TACAGAGAT GGGGAAGAC T GGGAAT C TAAGCAGAT GCCACAGTAT CGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
y s AGGAT T T GTCT TCAGAC TCCAGAGCAAATGCAGATGGGTAGAGCCT G TAT GATGT GGAGAT C TCC 

f<« AT TATAGT CCAGGAAAAT GC TGGGAGCATATT GT GT TCCAT CCACCT T GCT GAGCAGAGTCATGA 

GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
W TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
111 AAAT CCAAAGGGAAAATCCAGGC GGAACT GGAC TGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

||j AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
il T T T CT GAT C TGAAAAC TGTAAC CCAT AGAAAAGC TC CCCAGGAGG T GCC T CAC T C TGAGAAGAGA 

T T T ACAAGGAAGAG TGTGGTGGC TTCTCAGGGT T TC CAAGCAGGGAGACAT TAC TGGGAGGT GGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
p ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
gj GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
S TGACATGTCAGTTTGAAGGCTTGTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 
JJ| GGGACTCCCATATTCATATGTCCAGTGTCCTGGGG ATGAG ACAGAGAAGACCCTGCTTAAAGGGC 
Kl CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
fi(J CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
CI TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 

TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGATTCCTGCCTCACAGGTGAAGAT TAAAGAGACAACGAAT GT GAATCATGCTTGCAGGT T 
T GAGGGCACAGT GT T T GC TAAT GAT GT GT T TT TATAT TATACAT T TTCCCACCATAAAC TC TGT T 
TGCTTAT TCCACAT TAATT TACT TT TCTCTATACCAAATCACCCATGGAATAGTTATTGAACACC 
T GCT T TG TGAGGCT CAAAGAATAAAGAGGAGGTAGGAT T T T TCAC TGAT T CTATAAGC C CAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
CTCATTAACACAGACACAAAAATTCTAAATAAAATTTTAACAAATTAAACTAAACAATATATTTA 
AAGAT GATATATAAC TAC T CAGT GT GGTT T GTCCCACAAAT GCAGAGT T GGT T TAAT AT T TAAAT 
ATCAACCAGTGTAAT T CAGCACAT TAATAAAG TAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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MAFVLI LVLS F YELVSGQWQVT GPGKFVQALVGE DAVFS C S L FPE T SAEAME VRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGWMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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CCT T C AC AGGAC TCT T C ATTGCTGGT TGGC AATGAT GT AT CGGC CAGATGT GGT GAGGGC T AGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCGCTGATTGTCCTGGCAGTGTGCATTGGA 
CT C ACT GT T CAT T ATGTGAG AT AT AAT C AAAAG AAG ACCT AC AATT ACTAT AGC ACATT GTC AT TT AC AAC 
T GAC AAACT AT AT GCT GAGT T T GGCAGAGAGGCTT C TAAC AATT T T AC AGAAAT GAGCC AGAG ACT T GAAT 
CAAT GGTG AAAAAT GC AT TT T ATAAATCT C CATT AAGG GAAGAAT TTGTC AAGT CT C AGGTT AT CAAGTT C 
AGTC AACAGAAGCAT GGAGTGT TG GCTC AT AT GC T GTT GATT TGT AGATT TCACTCT ACT GAGG AT C C TGA 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
ATC C T CACTC AGTT AAAAT T AAAAAAAT C AAC AAGAC AGAAACAGAC AGC TATCTAAACC AT TGCT GC GG A 
H AC AC GAAGAAGT AAAACT CT AGGT C AGAGT CT CAGGAT CGT TGGTGGG AC AGAAGT AGAAGAGGGT GAAT G 

GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 

pj 

5; TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
8 a J AT AAAACCT T CGAAAATG AAACGGGGT C TCCGGAGAAT AATT GT CC AT GAAAAATACAAACACCC ATC AC A 

%l T GACT ATG AT AT TT CT CT TGC AGAGC TT TC T AGC CC TGTT CCCT AC AC AAAT GC AG T ACATAGAGT TT GT C 

yi TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
% J G ATGGT T ACAGT CAAAAT C ATCTT CG AC AAGCAC AGGT GACT CT C AT AGACGCT AC AACT TGCAAT GAACC 

g TC AAGC TT AC AAT GAC GCCAT AAC TCCT AG AATGT T AT GT GCTGGCT C CT TAGAAGG AAAAACAGATGC AT 

tl GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 

AGCT GGGGAGATGAAT GT GCGAAACCCAAC AAGC CT GGT GTT TAT ACT AGAG TT ACGGCC TT GCGGGACTG 
GATT AC TTCAAAAACT GGTATC TAA GAGAC AAAAGC CT CAT G G AAC AG AT AAC ATT TTTTTTTGTTTTTTG 
GGTGTGGAGGCCATTTTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACTCT GT CAT C TGT GAGCAAT AGTT GAAACT TT AT GT AC AT AGAGAAAT AG AT AATAC AAT AT T AC 
AT T ACAGCCT GT AT TC AT T TGT TCT CTAG AAGTT TT GT CAGAAT TT TGACTT GT T GAC AT AAAT T T GT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
TCAAGGTGCAGAACAAGGAGTGAAAGAAAATATAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAA 
AAGT AT T AGGTGTT TT TCT TAGTG GAAT AT T AGAAAT GAT CAT AT T CATT AT GAAAGGTC AAGC AAAGAC A 
GC AGAAT ACC AAT C ACTT CATC AT T T AGGAAGT ATGGGAACT AAGT T AAGGAAGTCCAGAAAGAAGCC AAG 
AT AT AT CCT TAT T T T C AT TT CC AAACAACT ACT ATG AT AAAT GTGAAG AAGATT CT GTT T TT T TGT GAC CT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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MMYRPDWRARKRVCWE PWVI GLVIFISL I VLAVC I GLT VH YVRYNQKKT YNYYS TLS FT T DKLY 
• AE FGRE ASNNFTEMS QRLE SMVKNAFYKS PLREE FVKS QVI KFSQQKHGVLAHMLL I CRFHS TE D 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTAS FGVTIKPSKMKRGL 
RRI IVHEKYKHPSHDYDI SLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DAT TCNE PQAYNDAI TPRMLCAGS LEGKTDACQGDS GGPLVS SDARD I W YLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTGTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
AT AAT G T T C T C AC T T TAAC T G C AG AAAC G AG GG TAGAGGAAGC AG T C AT T T T G AC T T AC T T T C C T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCAC TTTGAAAGCCAGGAT GACAAATTATGGAT TACCTAGATATCGGTGGC TTAGTCA 
TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
AAAT G AC AGAGAT GGACT GGCC CC C AGAT TCCTGCTGTGT T AGAGAAT T C C C AGGAT G T T CC AAA 
CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
O TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
m TCCTGGCCATGATTCTCAC CAT TACT CTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 

ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
GTTGAAACCAAGCC TGTCAAGAATCT TT GAACACACATCCATGGCAAACAGCT T TAATACACACT 
T TGAGAT GGAGGAGT TATAAAAAGAAAT GT CACAGAAGAAAACCACAAACT TGT T T TAT TGGACT 
T GT GAAT T T T T GAGTACATAC TAT G T GT T T CAGAAATAT G TAGAAAT AAAAATGT TGCCATAAAA 
%l TAACACC TAAGCATAT AC TAT T C TAT GCT T TAAAATGAGGATGGAAAAGT T TCATGTCATAAGT C 

ACCAC CT GGACAATAAT T GATGCCC T TAAAAT GC TGAAGACAGATGT CATACCCACTGTGTAGCC 
* TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
W T CCAT GCAAACGAGTCACATAT GGT GGGACTGGAGC CAT AG TAAAGGTTGATT TACT TCTACCAA 

111 C TAG T AT AT AAAGT AC T AAT T AAAT G C TAAC AT AG GAAG T T AGAAAAT AC T AATAAC T T T TAT T A 

C| CTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTCAATATTGGTGACT 
m ACCTAAATGTGATTTTTGCTGGTTACTAAAATATTCTTACCACTTAAAAGAGCAAGCTAACACAT 
5j TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
y TTCAGTTCTGATAATGTTAAGAATAACCATTATGAAAAGGAAAATTTGTCCTGTATAGCATCATT 
fll ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 

TGT TAT T T AAAT AC T TAAC C AC T AAT T T T G AAAAT T AC C AG TGT GAT AC AT AGGAAT CAT TAT T C 
AGAAT GTAGT C TGGT CT T TAGGAAGTAT T AATAAGAAAAT T TGCACATAAC T TAGTT GATT CAGA 
AAGGAC T T GT AT GCT GT T T T T CT CCCAAAT GAAGAC TC T T T T T GACAC T AAACAC T T T T T AAAAA 
• GCTTATCTTTGCCTTCTCCAAACAAGAAGCAATAGTCTCCAAGTCAATATAAATTCTACAGAAAA 
TAGT GT T CT T T T TCT CCAGAAAAAT GCTT GT GAGAAT CAT TAAAACATGTGACAAT T TAGAGAT T 
CTTTGTTT TAT T TCACTGAT TAATATACT GTGGCAAATTACACAGAT TAT T AAAT T T T T TTACAA 
GAGT AT AG TATAT T TAT T TGAAATGGGAAAAGT GCAT TT TACT G TAT T T TGTGTAT T T T GT T TAT 
T T CT CAGAATATGGAAAGAAAAT T AAAAT GT GT C AAT AAAT AT T T T C TAGAGAG TAA 
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FIGURE 108 



MAREDSVKCLRCLLYALNLLFWmSISVIAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPWHP 
VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FC VE LACGVWT YEQE LMVPVQWS DMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLS DLYQEGCGKKMYS FLRGTKQLQVLRFLG I S I GVTQ I LAMI LT I TLLWAL YYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 



53 » 

Jzt Transmembrane domains : 



amino acids 12-35, 57-86, 94-114, 226-248 



APP_ID=10063717 



Page 262 of 322 



FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGG ATG CCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
CAGCTGCTCAGATACCAGC GAGTGCACATGCGCTCCAGGAACT GTGCTAGGCCCTGACAGGCAGACAT GCTTTG 
AT GAAAAT GAAT GT GAGC AAAAC AACGGT GG CT GC AGT GAGAT CTGT GT GAACC T C AAAAACT C C T AC C GCT GT 
GAGTGTGGGGT T GGCCGTGT GCT AAGAAGT GATGGCAAGACTT GTGAAGAC GTTGAAGGATGCCACAAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGATAACCACACTTGCCAAGTCCCTGTGTTGTGCAAATCAAATGCCATTGAAGTGAACATCCCCAGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTT TCCACGCCT GTACACCATTTCT GAAGGATACGTTCCCAACCTTCGAAACTCCCCACT GGAAAT 
CAT GAGCC GAAAT C AT GGG AT CT T CCC ATT C ACT C T GGAGAT C T TCAAGGAC AAT GAGT T T GAAGAGC C T T AC C 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT CAGAAGC T GG GT AT AAT AT T T C AAGT T AC AAACCC T AGAAAAAT T AAAC AGTT ACT GAAAT TAT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAAT TTGGAAGT GTAT C AAT AAAAC AGT AT AT AAT T T T 



FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
T FCI PENHCGTHAP WLNGSHPLEGDGI VQRQACAS FNGNCCLWNTTVEVKACPGGYY VY RLTKPS VC FH V 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLECLRDSLYFGIEPWHV 
SGLE SL VESC FATPT SKI DEVLKY YL IRDGCVSDDS VKQ YTS RDHLAKH FQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGP I RI DWED 

H 

J! Important features of the protein: 

lail Signal peptide: 

* 

I A amino acids 1-16 

■ ■ 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 
%«i Tyrosine kinase phosphorylation sites. 

II! 

ammo acids 411-418, 443-451 

CI 

N-myristoylation sites, 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GG AGACCT TC AG AAAGGT GGGG AT CCCC AT C ATC AT AGCACT ACTG AGC.CT GGCGAGT AT C ATC AT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TC AGC AT C C AGT ACGAC AAACAGC AC GT C T GT GG AG GGAGC ATC CT GGAC CCCC ACT GGG TCCT C ACGGC A 
GCCC ACT GCT TCAGGAAACAT ACCGATGT GTT C AAC TGGAAGGT GCGGGC AGGC TCAG AC AAACT GGGC AG 
CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAZ^AGACAATGACA 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GAT GAGGAGC TC ACT CCAGCCACCCC ACT C TGGAT C AT TGGATGGGGCTT T ACG AAGC AG AATGGAGG GAA 
GATGTC TG AC AT ACTGCT GC AG GCGTCAGT CC AGGT CAT T GAC AGC AC AC GGTGC AATGC AGACGAT GCGT 
ACCAGGGGGAAGT C AC CG AGAAGATGATGT GT GC AGGC AT CC CGGAAG GGGGTGTGGAC ACCTGCC AG GGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 



III CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTG TAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGG AT CCCCCAAAGT C AGACAC AGAGC AAGAGTCCCC T TGGGT AC ACCC CTCT GCCC AC AGCCTC AG CAT 
T TCT TGGAGC AG CAAAGGGCCT C AAT TC CT GT AAGAGACCCT CGC AGCCC AG AGGCGCCC AGAGGAAG TC A 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGG GGT ATT GC TAAGCC AAGAAGGAACTT TC CC AC AC TACT GAATGG AAGC AGGCT GT C TT GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT AC T AGAGC AAGAAACC AGT T GT AAT AT AAAAT GC ACTGCC CT AC TGTTGGTATG AC TACCGT T ACCT 
ACT G TTGTC ATT GT TATT AC AGCT AT GGCC AC T ATT AT T AAAGAGCT GTGT AAC AT CT CT GGCAAAAAAAA 
AAAA 
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MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPI I IALLSLASI I IWVLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTGTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTGACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
|, ? : s CTTGCCCTGGCCGTAGAAGGGAT TGA CAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
CI AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
|;f GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
7 S J ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
%j C AGAAGGGGT C TGC T TAGACCACCTGGT TT AT GTGACAGGACT T GCAT T CT CC T GGAACATGAGG 

fait G AAC GC C GGAGGAAAGC AAAGT GGC AGGGAAGGAAC T T G TGCC AAAT TAT GGGT CAGAAAAGAT G 

N GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
*L ?s CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
S CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
f*l TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
PJ CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
Cl GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
■ " i? GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGAAT TACTTGAACC TGGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGAAA7!JyiTAZ\AAAAAGAATTA 
TGGTTATTTGTAA 
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MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACC ATG GCAA 
AGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T C T ACAGCAATGGAGAGAAGAAGAAG AT T TACAT GGAAAT TGAT CCT GT GACCAGAAC T GAAAT A 
TTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
C T G AAC C AG AAG AGGAAAT AG AT G AG AAT G AAG AAAT T AC C AC AAC T T T C T T T G AAC AG T C AG T G 
M ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
l~l GATTTGT GATAACGTGACCATGTAT TGGATCAATCCCACTCTAATAT CAGT TTCTGAGTTACAAG 

ACT T T GAGGAGGAGGGAGAAGATCT TCAC T T TCC T GCCAACGAAAAAAAAGGGAT TGAACAAAAT 
GAACAGT GGG TGGTCCCTCAAG TGAAAG T AGAGAAGACCCG T CACGCCAGACAAGCAAGTGAGGA 



N s AGAACT T C CAATAAATGAC TAT AC T GAAAATGGAATAGAATTT GAT CC CAT GC TGGATGAGAGAG 

%l GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 

% GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 

Hp T7UVCTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 

P TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 

CjjJ GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 

*J TTATCACCAAAAAAAAAAAAAAAAAAA 

IT: I 
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MAKNPPENCEDCHILNAEAFKSKKIC 

HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FSEPEEE I DENEE I TTT FFEQSVI WVPAEKP IENRDFLKNSKI LE I CDNVTMYWINPTLI SVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 
amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
T ATTTC ACCATC CT GGGACT T CCAGC C AT GCT GCAGGC AGTGCG AGC CCT GAT GAT C GT AGGCATCGT CC T 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
N GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
i: \ TGGGT CC AAC AC CAAAAAC AAGAAGAT AT ACGATGG AGGT GCCC GCAC AG AGGACGAGGT ACAATC T TAT C 

CTTCCAAGCACGACTATGTG TAA TGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
%i CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
g GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
0 TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
Ul TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 

GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
AC ACTT ACT GAAGAAGAAGC AATAAGAG AAAG AT AT TT GT AATC TCTCCAGCCC AT GAT C TCGGT T TT CT T 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGT CAGAAATT GTCCCTAGATGAATG AG AAAATT AT TT TT TTTAAT TTAAGT C CT AAAT ATAGTTAA 
AAT AAAT AAT GT T TTAGT AAAAT GAT AC AC TAT CTC TGT G AAAT AGCCTCACCC CTACATGT GGAT AG AAG 
GAAATGAAAAAAT AATTGCT TTGACATT GTCT ATAT GGTACTTTGTAAAGTCATGCTTAAGT ACAAATTCC 
AT GAAAAGCT CAC AC C TGT AAT CC TAGCACTTT GGGAGGCTG AGGAGG AAGG AT CACTTG AGCCC AGAAGT 
TCGAGACT AG CC TGGG CAAC AT GG AG AAGC CCT GTC TCT AC AAAAT AC AG AGAGAAAAAATCAGCC AG TC A 
TGGT GGC AT ACACCT G TAGT CCC AGC AT T C CGGG AGGCTGAG GT GGGAGGATCACT TG AGCCCAGGGAGGT 
T GGGGCT GC AGT GAGC CAT GAT CAC ACCAC TGC ACT CCAGCC AGGT GACAT AGC GAGAT C CT GTCT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 



m 
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MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPTyyiLQAVRAmiVGIVLGAIGLLVSIFALKCIRIGSMEDSAKAKMTLTSGIMFIVSGL 
CAI AGVSVFANMLVTNFWMS TANMYT GMGGMVQ TVQTRYT FGAALFVG WAGGLTL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains : 

amino acids 81-100, 121-141, 173-194 
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GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATA ATGG CAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
j. , CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
m ACT CAATAGTGAAT GT T GCCCAAAAACGT GAGC TT GGAGAAGC TCTCTACT TAGGATGGACCACGGCA 

T*\ CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
|f! CAGATACT CGAT ACCT T CCCATC GCAC AACCCAAAAAAGTTAT CACACCGGAAAGAAGT CACCGAGCG 

IU TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 

^ ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 

5 s *!? 

^ , AAT CT TAATTACAGGAACT GT GCATCAGCTAT T TATGAT TC TAT AAGC TATTT CAGC AGAATGAGATA 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
p| CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
HI ACG AC AT AGCAT TAT GT AC AT AG AT GAG T GT AACAT T TAT AT C T C AC AT AG AG AC AT GC T TAT AT GG T 
Q T T TAT T T AAAAT GAAAT GC C AGT C CAT T AC AC T GAAT AAAT AGAACT CAAC TAT T GC T T T T C AGG GAA 

tf.l ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
y TTTAT AAT GAAGATT AAAAT GAAGGCT TTAAT CAGCATTGTAAAGGAAATT GAATGGCT TT CTGATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
T TC C AG G G C TAT ACT C AGAAGAAAGAT AAAAG T GT GATC T AAGAAAAAGT G AT GGT T T TAGGAAAGT G 
AAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAAGAAATCATATATGTATGGAT 
AT AT T T TAATAAGTAT TTGAGTACAGACT TT GAGGT T TC AT CAAT ATAAAT AAAAGAGCAGAAAAAT A 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATT TATAT TAATAAAT TGTACATT TTT CTAAT T 
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mTHALE I AGLFLGGVGMVGTVAVTVM^ 

IYDSLLALSPDLQAARGLMCAASVMS FLAFMMAILGMKCTRCTGDNEKVKAHILLTAGIIFIITG 
MWLIPVSWVANAI IRDFYNS I VNVAQKRE LGEAL YLGWT TALVL I VGGAL FC C VFCCNE KS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGGC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAAT GT CT GAGGGAAAGC T T TGAGGAGT CCT GGACACCCAACTAC 
§^ AAGC AG T GT T CATGGAGT T CAT T GAAT TAT GGCATAGATC T T GGGAAAAT T GCGGAG T GT AC AT T 

Cl TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
|!j GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
Zl AT T GAAGCTATAAT T TAT T TGGACCAAGGAAGCCC T GAAAT GAAT T CAACAAT TAATAT TCATC G 

m 

%\ CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
Ik T TGGCAC T TGT T CAGAT TACCCAAAAGGAGATGC T T C TACTGGAT GGAAT T CAG T T T C TCGCAT C 

% l ATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 

T T G GAAT G G T T C AC T T AAAT GAC AT T T T AAAT AAG T T TAT GTAT AC AT C T GAAT GAAAAGC AAAG 
||l C TAAATAT GTT TACAGACCAAAGTGTGAT TT CACACTGT T T TT AAAT CT AGCAT TAT T CAT T T T G 

*** CTT CAAT CAAAAGTGG T T TCAAT AT TT TT TT TAG T T GGT TAGAATACT T T CT TCATAGT CACAT T 

pj CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAAT AT AAAAGCTAC CAAT C T T T GTAC AAT T TG T AAAT GT T AAGAAT T T T T T T TAT AT C T GT 
T AAAT AAAAAT TAT T T C CAACA 



ru 
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MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGG ATG GGTTTCAATG 
TAAT TAGGCT ACTGAGCGGAT CAGC T GTAGCAC TGGT TATAGC CCCCACT GTGT T AC T GACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
AT AAC AG C C T T C AAAAAC T T AAG T AT AAT C AAT T T AAAGGG C T C AAC C AG C T CAC C T G GC TAT AC 
CTTGACCATAACCATATCAGCAATATTGACGAAAATGCTTTTAATGGAATACGCAGACTCAAAGA 
GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACT GC CGCAACC T GGAAC TT T T GGACC T GGGATATAACCGGATCC GAAGT TTAGC CAGGAAT G 
T CT TT GCT GGCATGATCAGACT CAAAGAAC TT CACC T GGAGCACAAT CAAT T T T CCAAGCTCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
C AT AGGACAGACCAT GTCC TGGACC T GGAGCTCC TTACAAAGGC TT GAT T TATCAGGCAAT GAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
T CC AAC AAG C T C ACAT T TAT TGGT C AAGAGAT T T T GGAT T C T T G GAT AT C C CT C AAT GAC AT C AG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
ATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGTTTGATCTGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAG TCCC TAAAGCAAATGAC T CCCAGCACCCAGGAAT T T TAT GT A 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
TAACAAAT CGGGCT CCAGGGAGT GT GAGG T ATGAACCAT TGTGAT AAAAAGAGC T CT TAAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATTTTCCTCTCATACATAATCAACCCATTGAAATTTAAATACCACAATCAATGTGAAGCTT 
GAAC TCCGGT T TAATATAATACC TAT T GT ATAAGACCCT T TAC T GAT TCCAT TAATGTCGCAT T T 
GT T T TAAGATAAAAC T T CT TTCATAGG TAAAAAAAAAAA 
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a 

m 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS I CGKS T TERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

%i amino acids 420-442 

PI 

f| N-glycosylation sites. 

fl| amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGC AACC T T T CCCC TGGAT CT CACAAAAAC T CGAC TCCAAAT GCAAGGAG AAGC AGC T C T TGC T C 
GGT TGGGAGACGGT GCAAGAGAAT C TGCC CCC TATAGGGGAAT GGT GCGCACAGCCC TAGGGAT C 
ATTGAAGAGGAAGGC TT TC TAAAGCTTTGGCAAGGAGTGACACCCGCCATT TACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGCCAAT CCAAC TGACCTAGT GAAGGT T CAGAT G C AAAT GGAAG GAAAAAGGAAAC TGGAAGG 
AAAACCAT T GCGAT T TC GT GGT GT ACAT CATGCAT T T GC AAAAAT C T TAGC T GAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACTTATGATACAGTGAAACACTACTTGGTATTGAATACACCACTTGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTT T TGTATAAATCATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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MSVPSEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFLANPTDLVKVQMQMEGKRK^ 

NI QRAALYNMGDLTT YDTVKHYLVLNTPLEDNIMTHGLS SLCSGLVAS I LGTPADVTKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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SI 

t 



CGCGGATCGGACCCAAG-CAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATCGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
C7\AATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAAT GTGGACGATGGGCACATCAT TAACATCAATAGCAT GT CT GGCCACCGAGT GT T A 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 



■%t AGACACAAT T CGCCT TCAAAG T CCACGACAAGGAC CC TGAGAAGGCAGCT GCCAC CT ATGAGCAA 

jU ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATT GGAGACATCCAGATGAGGCCCACGGAGCAGGTGACC TAGT GACTGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTGATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
s|| CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKVVGCARTVGNIEELAAECKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNV1WLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PGWET Q FAFKLHDKDPEKAAAT YEQMKCLKPEDVAEAVI YVLS TPAH I Q IGD I QMRP TEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase. 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
• CCGTTGAGTGCTTGAA ATGAG GAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAAT TT TTAAC TGATAGTTGTACATATT TGGGGGTACATGTGATATT TGGATACATGTATACAA 
i ; ~L TATATAATGATCAAATCAGGGTAACTGGGATATCCATCACATCAAACATTTATTTTTTATTCTTT 
CI TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
0 CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 

m 

f . GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 

U! 

■ • % = CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCGAAAGTGTTATGATTACA 

GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
Sj ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGT AAGAAAAT GCAATAT TTGTCTTTCTGTGCC TGGC T TAT TT CAC T T AAC AT AATGAC T TCC T G 
TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 



Important features of* the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

J!f amino acids 86-92 

m . 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

\L amino acids 68-79 



is; 
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FIGURE 131 



huh 



m 

ril 



T T CT GAAGT AAC GGAAGCT ACC TT GT AT AAAGAC CT CAAC ACTGC TGACC ATGATC AGCGCAGC CT GG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
TTGAAAAACTTGCTGAAAGT AGAAAGAAT ATACCT ATACCAC AACAGT TT AGAT GAATTTCCTACCAACCT 
CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 
AAAT TC CCT ATC TGGAAG AATT AC AT TT AG AT GAG AAC T CTGTC T CTGCAGT T AGC AT AGAAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCC C AGGACT AT AG AAGAAC T AC GCT TGGAT G AT AATC GC AT AT CC AC TATTT C ATC ACC AT CTCT TC AAG 
GT CTCACT AGTCT AAAACGCCT GGTT CT AGAT GGAAAC C T GT TGAACAAT CAT GGT TT AGGT G ACAAAGT T 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 
CTT ATC TAAGGC AGCT CT ATCG ACT GGAT ATGT C CAAT AATAAC CT AAGT AATT TACCT C AGGGTATC TT T 
GATG AT TT GGAC AAT ATAAC AC AACT GATT CT T C GC AAC AAT CC CT GGT ATTGCGGGT GC AAG ATGAAAT G 
GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
AGGTTCGTGGGATGGCTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGC 
ACCATTCAGATAACCACT GCAAT ACCC AAC AC AGTGT AT CCT GC CC AAGG AC AG TGGCC AGCT CCAGT GAC 
CAAAC AGCC AGAT AT T AAGAACCCC AAGCT C ACT AAGG ATCAACAAACC AC AGG GAGTCC CTC AAGAAAAA 
CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 
ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
AAC AGGGG AACGC AGT GAGT ACTT GGTC AC AGCC CT GGAGCC TG ATTC AC CCT ATAAAGT AT GC AT GGTTC 
CC AT GGAAAC CAGC AACC TCTACCT AT T TGAT GAAACT CCTG TTTGT ATT GAGACT GAAACT GC AC CCCTT 
C G AATGT AC AACCCT ACAACCACCCTC AAT CGAG AGC AAG AG AAAGAACC T T AC AAAAACCC CAAT TT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGG AATGGATC GC TCTTCT C AAG GAACT GTGCAT AT AGC AAAGGG AGGAGAAGAAAG GATG ACT AT 
GC AGAAGCTGGC ACT AAGAAGGAC AAC T CT AT CC T GGAAATCAGGGAAAC TTCTTT T C AG ATGTT ACCAAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 122 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVWRGmCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKLALPMTALRLSWLKLGHSPAFGSITET 
M IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
P PYKNPNLPLAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 

Q 

ileiretsfqmlpisnepiskeefvihtifppngmnlyknnhsesssnrsyrdsgipdsdhshs 

W 

%j Important features of the protein: 

M Signal peptide: 

amino acids 1-28 

Transmembrane domain: 
$h amino acids 531-552 

Q N-glycosylation sites . 

Ill amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 

f 1 ' 

Jjw Tyrosine kinase phosphorylation site. 

m 

amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 123 



0 

. bait 

w 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTGAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCACT CCAACAGAGCCAGGAC TATATCAACC TCT T C T GCGCCAACATGAT GGACT TGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTGCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGATGCT T CGGGGAGCCT GAT GC TGAAGATGAAGAAT TATC TAAAGC TAT TCAA 
TATCAGCAGCAT T TT TCGAGGAGAGT GAAGAGGCGAGAAAAACAAT TT CCAGAT TC TC GCTC TGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAG GTGTGAGCCACCGTGTCTG 
GC T GAAAAGCAC T T T CAAAGAGAC T GT GT T GAAT AAAGGGCCAAGGT T C T T GCC ACCCAGCAC T C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLSQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAI QRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
Y INL FCANMMDLNRRAEAI GYAYPTRD I FMEN IMFCGMGGFS D FYKLRWLEAI L S WQKQQE GC FG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites . 

amino acids 10-16, 20-26, 63-69, 203-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1 . 

amino acids 230-237 
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FIGURE 13s 



GGTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCT TACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAA.T GAGAGCAAGATATG 
T GAAT T AC AT C AAAAC AT C AGAG G T T G T C AG AC T G C C C TAT C C T C T C C AAAT GAAAT C T T C AGG T 
jL^ CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
Q TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 

tl C TGACAT GAGACGGGAAATGGAGCAGT CAATGAATAT GC T GAAT TCCAACCATGAGT T GCC T GAT 

fit 

jf* GT T T C T GAGTTCAT GACAAGAC TCT T CT CT TCAAAAT CATC TGGCAAAT CTAGCAGCGGCAGCAG 

Js TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
Sj GAG T CAT CCCGACG T TGAT CTCTTACAAC T GT GTAT G TT 

* AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 

GAGGTCAATATTGATGTCACTGAATTAATTACAGTGTCCTATAGAAAATGCCATTAATAAATTAT 
m ATGAACTACTATACATTATGTATAT TAAT TAAAACATCT TAATCCAGAAATCAAAAAAAAAAAAA 

S| AAAAAAAAAAAAAAA 
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FIGURE 1*6 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFVVHDIPSGSYVVEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFmNPMVMMMVLPLLIFVLLPKVWTSDPDM 
Q SMNMLNS NHE LPDVS E FMT RL FS S KS S GKS S S G S S KT GKS GAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 



3 amino acids 37-40, 236-239 



01 

fi 

fi I 

m amino acids 151-154 



hi cAMP- and cGMP -dependent protein kinase phosphorylation site. 



N-myristoylation sites . 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 127 

GATGGCGCAGCCAGAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACT CGCACT CAGAC TC T GAAC T CAGACC TGAAAT CT TCT CT T CACGGGAGGC T TGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
J*;* AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
G T GC AAG C C AAAG AT GGAAACAT T GAC AT C AGAAT C T T AAG GAG G AC T GAG T C T T T GC AAG AC AC A 

O AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
J*j AAACTACCAGACCCC TGACCAT TAT AC TC T CCGGAAGATCAGCAGCC TCGC CAAT TCC T TT CT TA 

Ci CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
hh AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
S'l T T T GGGGGAAC TAGACAT TCT TC TG CAAT GGAT GGAGGAGACAGAATAGGAGGAAAGTGATGCTG 

2^ CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
?J{ ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
H CT T CC T T GCATGAT T G TC T T TAT GCAT C CCCAATC T TAAT T GAGACC ATAC T TG TAT AAGAT T T T 

|0 T GTAATATC TT TC TGC TAT TGGATATAT T TAT TAGT TAATATAT T TAT T TAT T T T T T GCTAT T TA 

CI ATGTATT TATT TT TT TACT TGGACATGAAACT TTAAAAAAATTCACAGATTATATTTATAACCTG 

s *? ACT AGAGCAGGTGATG TAT TT T T AT ACAGTAAAAAAAAAAAACCT T GTAAAT TC TAGAAGAGTGG 

CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATT TGAAAT TGAACCAATGACTACT TAGGATGGGTTGTGGAATAAGT TTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
C CAGGAATCCTACACGGCCAGCATG TATT T CTACAAATAAAGT T TT C T T TGCATACCAAAAAAAA 
AAAAAAAAAAA 



APP ID=10063717 



Page 291 of 322 



FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
S SRE AWQ FFLLLWS PDFRPKMKAS S LAFSLLSAAFYLLWTP S TGLKTLNLGSCVIATNLQE I RNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
S LANS FLT I KKDLRLSHAHMTCHCGEEAMKKYS QI L SHFEKLE PQAAWKALGELDI LLQWMEET 
E 



Important features of the protein: 

s . Signal peptide: 

if* 8 ! amino acids 1-42 

H 

il- cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

til 

,7'; ammo acids 192-195, 225-228 



HJ N-myristoylation sites. 

* amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTGGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGT GTGAAAT T TAT GCCCTACACAAC T TACC T T GT GGAAAAAGGAGCATC TCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACGTCAATTCTGCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGC TGGAAAAAGAAT AGTCT T T TAT GGAGAT GAAACCTGGGT T AAAT TA 
TTCCCAAAGCAT TT TGTGGAATATGATGGAACAACCTCAT TTTT CGTGTCAGAT TACACAGAGGT 
a f GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 

TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
C| AAGCT GAGC GAGAT GGACAGCGT GCT GAT GAAGATCCACACCT CACT GCAG T CGAAGGAGAGAGA 

O GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
ffi GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
T I CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
jf.f ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
H CAATGAGAGAGCAG T T GAGATT T T TACAT T TGAATACAG TGCAGC T TAGTAAAC TGT TGCAAGAG 

M AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
\J GAACTGGAT CAGAC TGTACTTGGAGGAAAAGCAT TCAGAAGTCCTAT TCAACCTGGGC TCCAAGG 

TTGTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
^ TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
fel CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
II! CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGGAGGCT 
Q GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
SI- AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 

TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
fcjf CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
flj TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGGCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T GCACACAGTATG T AGT TACCAAAAGAATAAACGGCAATAAT T GAGAAAAAAAA 
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FIGURE 1AO 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNS PALLEDS VI RQAKAAGKRI VFYGDE TWVKL FPKH FVS YDGT TS FFVS DYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMS E TGS HGAS STEEVNTPL I L I S S AFERKPGDI RHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

If! Transmembrane domain: 

&l amino acids 58-7 6 

iiy N-glycosylation sites. 

j amino acids 56-60, 194-198 

N-myristoylation sites. 

|: amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 

Pi 275-281, 278-284 

HI 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGC C T TCCAGGT TAT CG T GACGCACC TT GAAAG TC T GAGAGC TAC TGCC C TACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCC.CTAT AGAAAACAAC TGC CAGCAC C T T AAGACCAC T CACACC T T C AGAGT GAAGAAC T T AAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGTGACAGATAAATT TGAGAACAGGAAACACAT TGAATT TTCATT TCAACCAGT TTGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 



pi : 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALAS SLSSASAEKGSP ILLGVSKGE FCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWmLESAAHPGWFICTSCNC3STEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP -dependent protein kinase phosphorylation site, 

amino acids 33-3 6 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



CTAGAGAGTATAGGGCAGAAGGAT G GCAGAT GAGT GAC T CCACATCCAGAGCT GCCTCCCTT TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTT.GCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
& GGGGAGAACTAGGTGTGGGAAAT TT GAAGACGACAT T GACAACT GCCATT T C CAAGAAAGCACAG 

£[ AGCTGAACAATACT T TCACCTGCT T CT T CACCAT CAGC AC CAGGCCCTGGAT GAC TCAGT TCAGC 

g| CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
lilj GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
NJ TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
f . ACACATCTT GAGCCTAATCATGTAGTGTAGATCATTAAACATCAGCAT TT TAAGAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAAAAAAA^ 

|| 

Pi 

Hi " 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMT Q FSLLNKTCLEGFH 

Important features of" the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 



APPJD=1 00637 17 



Page 298 



FIGURE 14* 



CTGT GCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTT CT GACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAAT CAAGT GGAACCGGAAGGCCC TGCC CAGCAC TGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGC T AGAAAGTGGCAGAGCCAGG AT T CAAC CC TGGC T TGT CTAAC C CCAGG TTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C TCTAAT GAAATTGT GAAAGC T C CATGT T TAGAAATAAATGAAAACAC CT GA 
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FIGURE 146 

MRKHLS WWWLATVCMLL FS HLSAVQTRG I KHRI KWNRKALPS TAQI TEAQVAENRPGAFI KQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 

CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 

CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 

AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 

CAACT TTCAGAT TCAGGGGGTACATGTGAAGGT TTGT TT TATGAGTATATTGC ATGA TGCTGAGG . 

TTTGGGGT 



O 



W 



m 
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FIGURE 14.8 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGT GCAGAAAGACCTTT GAAT GAACAGAT TGCT GAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
TGTTGATAACTTGAACCTGCTAAAGGeAATAACAGAAAA.GGAAAAAATTGAGAAAGAAAGACAATGTA 
^ TAAGAAGCTCCCCACTTGATAATAAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTG 
'P. ATCGATGATTATGACTCTACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCA 
G| T CAACTAGACGGGACTCCTTTAACCGC TGAAGACATT GT CCATAAAATCGCT GCCAGGATT TATGAAG 

U] AAAA.T GACAGAGC CGTGTTT GACAAGATT GTTT CTAAAC TACTTAATCTCGGCCTTATCACAGAAAGC 

fi CAAGCACAT ACACTGGAAGAT GAAGTAGCAGAGGTT TTACAAAAATTAATCT CAAAGGAAGCCAACAA 

TTAT GAGGAGGAT CCCAATAAGCCCACAAGCT GGACT GAGAAT CAGGCTGGAAAAAI ACCAGAGAAAG 
^ I TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 
g TTAACCT T GACAAATGGCTT GGAAAGGAGAACTAAAACCTACAGT GAAGACAACT T T GAGGAACT CCA 

P AT AT TTCCCAAATTTCTATGCGCTACT GAAAA.GTAT T GATT CAGAAAAAGAAGCAAAAGAGAAAGAAA. 

Ill CACT GAT TACTAT CAT GAAAACACTGATT GAC TTTGT GAAGAT GAT GGT GAAATATGGAACAATATCT 

M CCAGAAGAAGGT GTTTCCTACCTT GAAAACTT GGAT GAAAT GATT GC TCTTCAGACCAAAAACAAGCT 

. AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
CAGACAGT ACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAATATGGAAGCTT GAAGGAT T CCACAAAA 
GAT GATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCC TAT TTGGAAGC 
CAT CAGAAAAAAT ATT GAAT GGTT GAAGAAACATGACAAAAAGGGAAATAAA.GAAGATTATGACCTTT 
CAAAGAT GAGAGACTT CAT CAATAAACAAGCT GATGCTTAT GT GGAGAAAGGCATCCTT GACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAA7VAAAAAAAAAA 
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FIGURE iso 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKIEKERQS I RSS PLDNKLNVEDVDSTKNRKLIDDYDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVS-KLLNLGLITESQAHTLEDEVAE 
VLQKL I SKEANNYEEDPNKPT SWTENQAGKI PEKVT PMAAI QDGLAKGENDE TVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAI RKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAAT AACCAGC T T C TAGC TGGAGGGC T GCAT GCAGGGAAGGTCAT TAAAGGT GAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGT7\AAAACCATGTGGGGTAAA 
lZ CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
M TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
CI GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
ff| GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
M TGCTACCTTT.CCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 



III 



GAAAT GGCTCGAGC TCAGAAGATAAAAGATAAGTAGGGT ATGC T GAT CCT CT TT TAAAAACCCAA 
GATACT^T CAAAAT CCCAGATGC TGGT CT CT AT T CCCATGAAAAAGT GCT CATGACATAT T GAGA 
AGAC C TAC T T ACAAAGTGGCATATAT T GCAAT T TAT T T TAAT TAAAAGAT ACCTATT TATATAT T 
TCTTTATAG7UU\AAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT TC C T T GTATAAAAATAAGAAAAGAAAT TAAT C T TGAGG TAAGCAGAGCAGACATCAT C TC TGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
T T GTAGTAGTGATCAGGAAACAGAT C T CAGCAAAGC CACT GAGGAGGAGGCT GT GC TGAGT T T G T 
GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GAT CACAGCCCC TGGGAT T CCAAGGCATT GGATCCAG T CT C T AAGAAGGCT GC T GTAC TGGT T GA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GAC TGGT T T CC T T GTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGAC TATGTA 
AAGAT GAAGGCAGAGATCGGAG T T T T GCAGCCACAAGC TAAGAAACACCAAGGAT T GT GGCAACC 
ATCAGAAGC T T GGAAGAGGCAAAGAAGAAT TCT T CC C TAGAGGCT TTAGAGGGATAACGGCT CTG 
CTGAAACCT TAATCTCAGAC TT CCAG CCT C CTGAAC GAAGAAAGAATAAAT T TCGGCT GT T T TAA 
GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
T AG AAT AT G G C AG AAG T GAT G G CAT G C C AC T T C C AAGAT TAG G T T AT AAAAG AC AC T G C AGC T T C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGC TAGC T GCCAT GC TATGAGCAGGCCTATAAAGAGAC T TACG T GGTAAAAAAT GAAG TC TCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAGT T GCT CAGT T T TGG T C TAAC T TG TTAT GCAGCAATAGATAAATAAT AT GCAGAGAAAGAG 
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FIGURE 152 

MVLSGALC FRMKDSALKVLYLHNNQ'LLAGGLHAGKVIKGEE I SWPNRWLDAS LS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites . 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 



APP ID-10063717 



Page 306 



FIGURE 1*3 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATGG CCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTCTGAGAAAT 
GCCTGCATT TGAC CAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
CAATT AGATGCCCCAAAGC GAT T T T T TT TAACCAAAAGGAAGAT GGGAAGCCAAACTCCATCAT G 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
AGACCAGAAGGTAGACTT TCTAAGCATAGATATT TATTGATAACATT TCAT TGTAACTGGTGTTC 
T AT AC AC AG AAAACAAT T TAT T T T T T AAAT AAT TGTCTTTT T CCAT AAAAAAGAT T AC T T T C CAT 
TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 

s? 

gj TGTATTTATTATTATTATAAGACTGCATTTTATTTATATCATTTTATTAATATGGATTTATTTAT 
||! AGAAACAT CAT TCGATAT TGC TACT T GAGT GTAAGGC T AAT AT T GATATTT ATGACAATAAT TAT 

AGAGC T ATAACAT GT T TAT T TGACCT CAAT AAACAC T T GGATAT CCC 
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FIGURE 154 

MAALQKS VS S FLMGTLAT SCLLLLALLVQGGAAAP I S SHCRLDKSNFQQPY I TNRT FMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites . 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGT CAGT GCC CGAC T TGTGAC TGAGT GT GCAG TGCC CAGCATGT ACCAGG TCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTGTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
; a | GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
14 ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
%l CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
* CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 

GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
PiJ GC CCAGCACAGGCAC T T T C TAGATAT TTCCCCCTTGC TGGAGAAGAAAGAGCCC C TGGT TT TAT T 

fll TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
C| T T GAC AT G GAT GAT T C T GAG GAGGAAG CTGTTATT GAAT G T AT AG AGAT T TAT C CAAAT AAAT AT 

fU CTTTATTTAAAAATGAAAAA 



hi 
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FIGURE 1*6 



MRERPRLGEDSSLISLFLQVVAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 13 6-140 

Tyrosine kinase phosphorylation site. 



%| amino acids 127-135 

N-myristoylation sites. 

mi amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT TGAGGGACCT CCGAGT AGAACCT GT TACAAC TAGT GT T GCAACAGGGGACTAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
C C C T T CC AT GT C T G T GAAT T T CAC C T CAC C AGGC T G C C T AGAC CACAT AAT GAAAT AT AAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCGGAACATCAGTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
C AT GGGTGCTGGTGGCAGGGATCTATCTAAT GTGGAGGCACGAAAGGATCAAGAAGACT TCCT T T 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
T CACACAAT T TGT TAG T T CACT GAAT T T C T T CAAAACCAT T GCAGAAGT GAGG TCAT CC T T GAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GT GATCTAAGAAGC CAGAT T CAT CT GCACAAATACGTGGTGGT CT AC T T TAGAGAGAT TGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAG7VACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 1*8 



m - 287 



fij 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWIX&QHDL^^^ 

LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQH 
ST I IGFSQVFEPHQKKQTRASWI PVTGDSEGAT VQLT PY FPTCGSDC I RHKGT WLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLWYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQI HLHKY VWY FRE IDTKDDYNAL SVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



AGCCACCAGCGCAACATCACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
AC T C AGC T GAAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLS ILGLAFLSEAAARKI'PKVGHTFFQKPESCPPVPGGSMKLDIGI INEN 
QRVSMSRNI E S RS T S PWNYTVT WDPNRYPSE WQAQCRNLGC I NAQGKED I SMNS VP I QQE TLW 
RRKHQGCSVS FQLEKVLVTVGCTCVTPVIHHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC T GGCCAAACAAAAACGAAAGCAC TCCG TGCTGGAAGTAGGAGGAGAGT CAGGAC T CC CAGG 
ACAGAGAG T GCACAAAC TACCCAGCACAGC CCC CTCC GCCCCCT CT GGAGGCTGAAGAGGGAT TC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
L] ? AACAGCTCGGAGAAGGTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
%J CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 

GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
L CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CI CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
U| CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
£*$ CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 

CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
3 CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
II GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGGCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 



m 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLEEILVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLL TL Q S WLLD APC S L P AE AALC W RAPGG DPCQ PL V P P L S W EN VT VD KVLE FP LL KGH P NL C VQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY I HKRWALWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSEIRELSAQGPVAWFHAQRRQTLQEGGWVLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
L P SQL P D FLG ALQQ PRAP RS GRLQ E RAE Q V S RAL QP ALD S Y F H P PGT P AP GRGVGP G AG P GAGDGT 



HI Signal sequence: 

y| amino acids 1-20 

fek Transmembrane domain. 

amino acids 453-475 



N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
ffi 334-337, 357-360, 391-394 



ffj Glycosaminoglycan attachment site. 

amino acids 583-58 6 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-myristoylation sites. \ 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



■si;? 

m 



GGGAGGGCTCTGTGCCAGCCCCGATCAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATGTTCCATGACCTG 
T T CTACCAC T TAGAGCTCCAGG T CAACCGCACC TACCAAAT GCACCT T GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
M CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
Q TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
"1 AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
T CCACAGCGGCATAGCC TGTCCGAGAT CACCTAC T TAGGGCAGCCAGACATCTCCATCC TCCAGC 
CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCGTCCCT 
CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
CCCT GGGGAT T TGCACAGACAGAACAT CT GACCCAAATGTGCT ACACAGTG GGGAGGAAGGGACA 
CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
m CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
5f TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
V* . GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

PI AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
£| ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
fil AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGT GCAAGGAGGAAAT GCAGGGAAAC TCCCGAGGT C CAGAGCCC CAC C T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCTTT TCTGCAGGCAGGAGT T TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCT G TAATGTGGGGATCATAACAC CT AC C T CAT GGAGTTGTGG T GAAGAT GAAAT GAAGTCAT G 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
E FLGT IMI CVP T WAKE S AP YMCRVKT L PDRTWTYS FS GAFL FSMG FLVAVLC YL S YRYVTKP PAP 
PNS LNVQRVLT FQPLRF I QEHVL I PVFDLS GP S SLAQP VQYSQ I RVSGPRE PAGAPQRHS LSE I T 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
H SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

m 

'IKS if 

fl 

Signal sequence. 

g r ; amino acids 1-17 

fix : 



Hv Transmembrane domain. 

amino acids 233-250 

85' 

f| 

ifi N-glycosylation sites. 

II s 

Q. amino acids 80-83, 87-90, 172-175 

i 

«*» N-myristoylation sites. 

ni 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16* 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGT TGAACACAGT T T TGGATAT T T T CCAAAAGAT T TGATCAAGG TACT T CATAAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACTTTGTCTGCTTTGAAGGAGGAAGAGATGATT 
T T AATAGT TATAATGTAGAAGAGCT T T TAGGAT CT TTGGAACT GGAGGACTCTGTACCTGAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCT GCGGCTAACGCTCAGGGAGTGCAGT CT TCGT TGGACAC TTT TGAAGAAAT TC TGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 
CATTACAGCAAAGGATTTCGTTGGCATCAAAATCTAAGTTTGTTTTACAfiAGATTGTTTT TAG TA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA. 
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FIGURE 166 



MAAAPGLLFWL.FVLGALWWVP^ 

FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 

FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCC T T GAGAAT GAAGCAGAACAGAAGT T AATAGAGAGCAT GT TGCAAAACC TGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
T GCCT GCCCAGAT CTC TACCAGT GGT CT GAT GGAAGCAAT TCCCAGTACCGAAAC TGG TACACAG 
U ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 

CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCT^CAATTATATTTG 
5! CAAGTATGAACCAGAGAT TAATCCAACAGCC CC TGTAGAAAAGC C T TATCT T ACAAAT CAAC CAG 

y GAGACACC CAT CAGAATG TGGT T GT TACT GAAGCAGGTATAAT TCCCAAT CTAAT T TATGT TG T T 

if* ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
hi GC AT AAAAGTAAAGGAAGAACAAAAAC TAGT CCAAACCAGT CTACAC TGTGGAT T TCAAAGAGTA 

Cf CCAGAAAAGAAAGT GGCATGGAAGT ATAA TAAC TCAT T GAC TT GGT T CCAGAAT T T TGTAAT T CT 

^ GGAT CTGTATAAGGAATGGCAT CAGAACAAT AGC TT GGAAT GGCT TGAAAT CACAAAGGATC TGC 

H AAGAT GAAC TG TAAGC TCCC CC T TGAGGCAAATATTAAAGTAAT T T T TATATGTC TAT TAT T T CA 

H TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
r ACT T CAAAC T TCAAGCAAATGAAAT GGACAAI GCAGATAAA3T T G T TAT CAACACG TCGGGAGTA 

gl TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
t% TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
f} ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
O AAACTTAT TACTGT TGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTT TGT 

111 TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
pi AATAAGAAGCTATT TCAT TAAGT GT GATATAAACC TCC TCAAACAT T T TAC T TAGAGGCAAGGAT 

T GT C TAAT T TCAAT T G TGCAAGACAT GT GC C T TATAAT TAT TT T TAG C T TAAAAT TAAACAGAT T 
T TGTAATAATGTAACT TTGT TAATAGGTGCATAAACACTAATGCAGTCAAT TTGAACAAAAGAAG 
T GACATACACAATATAAATCATATGTCTTCACACGT TGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
AC TAT AT T AGTATACAAAGAGGT CAT GTGGT TGAGACCAGGTGAATAG TCAC TAT CAGTGTGGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAA.GGAAC TACGAAAT CGT GT GAAAAT GGGT TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAA.GTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAMAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
.GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNWVTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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